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<120> infectious cDNA Clone of GB virus B and 
Uses Thereof 

<130> NIH257.001NP 

<140> US 10/009,002 
<141> 2003-01-14 
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<160> 13 

<170> FastSEQ for windows Version 4.0 

<210> 1 
<211> 9399 
<212> DNA 
<213> GBV-B virus 

<400> 1 

accacaaaca ctccagtttg ttacactccg ctaggaatgc tcctggagca ccccccctag 60 
cagggcgtgg gggatttccc ctgcccgtct gcagaagggt ggagccaacc accttagtat 120 
gtaggcggcg ggactcatga cgctcgcgtg atgacaagcg ccaagcttga cttggatggc 180 
cctgatgggc gttcatgggt tcggtggtgg tggcgcttta ggcagcctcc acgcccacca 240 
cctcccagat agagcggcgg cactgtaggg aagaccgggg accggtcact accaaggacg 300 
cagacctctt tttgagtatc acgcctccgg aagtagttgg gcaagcccac ctatatgtgt 360 
tgggatggtt ggggttagcc atccataccg tactgcctga tagggtcctt gcgaggggat 420 
ctgggagtct cgtagaccgt agcacatgcc tgttatttct actcaaacaa gtcctgtacc 480 
tgcgcccaga acgcgcaaga acaagcagac gcaggcttca tatcctgtgt ccattaaaac 540 
atctgttgaa aggggacaac gagcaaagcg caaagtccag cgcgatgctc ggcctcgtaa 600 
ttacaaaatt gctggtatcc atgatggctt gcagacattg gctcaggctg ctttgccagc 660 
tcatggttgg ggacgccaag accctcgcca taagtctcgc aatcttggaa tccttctgga 720 
ttaccctttg gggtggattg gtgatgttac aactcacaca cctctagtag gcccgctggt 780 
ggcaggagcg gtcgttcgac cagtctgcca gatagtacgc ttgctggagg atggagtcaa 840 
ctgggctact ggttggttcg gtgtccacct ttttgtggta tgtctgctat ctttggcctg 900 
tccctgtagt ggggcgcggg tcactgaccc agacacaaat accacaatcc tgaccaattg 960 
ctgccagcgt aatcaggtta tctattgttc tccttccact tgcctacacg agcctggttg 1020 
tgtgatctgt gcggacgagt gctgggttcc cgccaatccg tacatctcac acccttccaa 1080 
ttggactggc acggactcct tcttggctga ccacattgat tttgttatgg gcgctcttgt 1140 
gacctgtgac gcccttgaca ttggtgagtt gtgtggtgcg tgtgtattag tcggtgactg 1200 
gcttgtcagg cactggctta ttcacataga cctcaatgaa actggtactt gttacctgga 1260 
agtgcccact ggaatagatc ctgggttcct agggtttatc gggtggatgg ccggcaaggt 1320 
cgaggctgtc atcttcttga ccaaactggc ttcacaagta ccatacgcta ttgcgactat 1380 
gtttagcagt gtacactacc tggcggttgg cgctctgatc tactatgcct ctcggggcaa 1440 
gtggtatcag ttgctcctag cgcttatgct ttacatagaa gcgacctctg gaaaccccat 1500 
cagggtgccc actggatgct caatagctga gttttgctcg cctttgatga taccatgtcc 1560 
ttgccactct tatttgagtg agaatgtgtc agaagtcatt tgttacagtc caaagtggac 1620 
caggcctatc actctagagt ataacaactc catatcttgg tacccctata caatccctgg 1680 
tgcgagggga tgtatggtta aattcaaaaa taacacatgg ggttgctgcc gtattcgcaa 1740 
tgtgccatcg tactgcacta tgggcactga tgcagtgtgg aacgacactc gcaacactta 1800 
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cgaagcatgc 
attggctata 
aggccatttg 
ttccactctc 
tggttcttgg 
attgatcacc 
tttgtctctt 
cagcaagtat 
tggttaccct 
tttgtctaaa 
ctgcaggcta 
aactttcttt 
agtggcaggg 
aggtccttgg 
ttttgatacc 
gtctcgtttt 
ttggcaacgt 
tgtgctggtt 
cgtagctctt 
tagggcccat 
tcttaagttt 
tggtgatgtc 
ttttgaaggc 
ggttgatggt 
gccgccagat 
cacgctgtca 
tatcttcaga 
gtatactgct 
cccaataacc 
gtcccttact 
attggttgag 
ggctgttgcc 
gatgttcacc 
ggtgtgtgct 
tgtgcctaac 
caaattacca 
ggctacaaca 
ctattttaat 
gtacctgacc 
taccgatgca 
tgttaggcta 
caacataact 
taaggaggaa 
tgatgagctt 
atgtgacatc 
tacagggtac 
atgccatgtt 
aatagttaaa 
tgtagacggg 
cgacgcagcc 
ctatcgcacc 
cttttctatg 
ttatgttttg 
caatgacgca 
gcgcttggac 
aatgtgcttc 
ggctatggct 
tattacatca 
ggaggagtgt 
gagtacaatc 
ctttcttggg 
cactttacct 
cccactacct 



ggtgtaacac 
ttacaatacc 
tattttgagg 
ctaccaccgg 
ttacaggttc 
aaagacaaag 
acgggagtta 
cttattttag 
ttgcgtcctg 
gctcaagtag 
cgttatgctg 
gttgcagcag 
ttagttttgt 
cctctggtag 
gagataattg 
ggcttctttg 
tgggagaatt 
tgtttccccg 
ctatgtttaa 
agaatgttgg 
ttcctcttag 
ttgcctaatg 
aaggcaaggg 
ttgcccgttg 
gggtgggcca 
gcgatggcag 
ttaggatctc 
caccatggca 
gttgacgcgg 
cggtgctctt 
gtcaacaaat 
aagggttctt 
gctgctagaa 
ggataccatc 
gagtattcag 
ctttcttaca 
gcatcaatgc 
ggcaaatgta 
ggagcatgtt 
accaccgtgt 
gtggttcttg 
gagattcaat 
aatctgaaga 
gctaacgagt 
tcaaaaatcc 
actggtgact 
gaccttgacc 
ggccagcgta 
agttgtaccc 
aaggcatggt 
caacctgggt 
gtcaaccccg 
ttgactgcag 
ccacggtggc 
ggcgctgacg 
actgaagtca 
tatctagcca 
gtccctaccg 
gcatcattca 
accacaacta 
cctcatgcag 
gacaatccct 
cacaagatca 
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catggctaac aaccgcatgg cacaacggct cagccctgaa 1860 

ctgggtctaa agaaatgttt aaacctcata attggatgtc 1920 

gatcagatac ccctatagtt tacttttatg accctgtgaa 1980 

agaggtgggc taggttgccc ggtaccccac ctgtggtacg 2040 

cgcaagggtt ttacagtgat gtgaaagacc tagccacagg 2100 

cctggaaaaa ttatcaggtc ttatattccg ccacgggtgc 2160 

ccaccaaggc cgtggtgcta attctgttgg ggttgtgtgg 2220 

cctacctctg ttacttgtcc ctttgttttg ggcgcgcttc 2280 

tgctcccatc ccagtcgtat ctccaagctg gctgggatgt 2340 

ctccttttgc tttgattttc ttcatctgtt gctatctccg 2400 

cccttttagg gtttgtgccc atggctgcgg gcttgcccct 2460 

ctgctgccca accagattat gactggtggg tgcgactgct 2520 

gggccggccg taaccgtggt caccgcatag ctctgcttgt 2580 

cgcttttaac cctcttgcat ttggttacgc ctgcttcagc 2640 

gagggctgac aataccacct gtagtagcat tagttgtcat 2700 

ctcacttgtt acctcgctgt gctttagtta actcctatct 2760 

ggttttggaa cgttacacta agaccggaga ggtttttcct 2820 

gtgcgacata tgacgcgctg gtgactttct gtgtgtgtca 2880 

catccagtgc agcatcgttc tttgggactg actctagggt 2940 

tgcgtctcgg aaagtgtcat gcttggtatt ctcattatgt 3000 

tgtttggtga gaatggtgtg tttttctata agcacttgca 3060 

attttgcctc gaaactacca ttgcaagagc catttttccc 3120 

tctataggaa tgaaggaaga cgcttggcgt gtggggacac 3180 

ttgcgcgtct cggcgacctt gttttcgcag ggttggctat 3240 

ttaccgcacc ttttacgctg cagtgtctct ctgaacgtgg 3300 

tggtcatgac tggtatagac ccccgaactt ggactggaac 3360 

tggccactag ctacatggga tttgtttgtg acaacgtgtt 3420 

gcaaggggcg ccggttggct catcccacag gctctataca 3480 

ctaatgacca ggacatctat caaccaccat gtggagctgg 3540 

gcggggagac caaggggtat ctggtaacac gactggggtc 3600 

ccgatgaccc ttattggtgt gtgtgcgggg cccttcccat 3660 

caggtgcccc gattctgtgc tcctccgggc atgttattgg 3720 

attctggcgg ttcagtcagt cagattaggg ttaggccgtt 3780 

cccagtacac agcacatgcc actcttgata caaaacctac 3840 

tgcaaatttt aattgccccc actggcagcg gcaagtcaac 3900 

tgcaggagaa gtatgaggtc ttggtcctaa atcccagtgt 3960 

caaagtacat gcacgcgacg tacggcgtga atccaaattg 4020 

ccaacacagg ggcttcactt acgtacagca catatggcat 4080 

cccggaacta tgatgtaatc atttgtgacg aatgccatgc 4140 

tgggcattgg aaaggtccta accgaagctc catccaaaaa 4200 

ccacggctac cccccctgga gtaatcccta caccacatgc 4260 

taaccgatga aggcactatc ccctttcatg gaaaaaagat 4320 

aagggagaca ccttatcttt gaggctacca aaaaacactg 4380 

tagctcgaaa gggaataaca gctgtctctt actatagggg 4440 

ctgagggcga ctgtgtagta gttgccactg atgccttgtg 4500 

ttgattccgt gtatgactgc agcctcatgg tagaaggcac 4560 

ctactttcac catgggtgtt cgtgtgtgcg gggtttcagc 4620 

ggggccgcac aggccgtggg agagctggca tatactacta 4680 

cttcgggtat ggttcctgaa tgcaacattg ttgaagcctt 4740 

atggtttgtc atcaacagaa gctcaaacta ttctggacac 4800 

tacctgcgat aggagcaaat ttggacgagt gggctgatct 4860 

aaccttcatt tgtcaatact gcaaaaagaa ctgctgacaa 4920 

cccaactaca actgtgtcat cagtatggct atgctgctcc 4980 

agggagcccg gcttgggaaa aaaccttgtg gggttctgtg 5040 

cctgtcctgg cccagagccc agcgaggtga ccagatacca 5100 

atacttctgg gacagccgca ctcgctgttg gcgttggagt 5160 

ttgacacttt tggcgccact tgtgtgcggc gttgctggtc 5220 

gtgctactgt cgccccagtg gttgacgaag aagaaatcgt 5280 

ttcccttgga ggccatggtt gctgcaattg acaagctgaa 5340 

gtcctttcac attggaaacc gcccttgaaa aacttaacac 5400 

ctacaatcct tgctatcata gagtattgct gtggtttagt 5460 

ttgcatcatg cgtgtttgct ttcattgcgg gtattactac 5520 

aaatgttcct gtcattattt ggaggcgcaa ttgcgtccaa 5580 
Page 2 



2540109_1.TXT 

gcttacagac gctagaggcg cactggcgtt catgatggcc ggggctgcgg gaacagctct 5640 
tggtacatgg acatcggtgg gttttgtctt tgacatgcta ggcggctatg ctgccgcctc 5700 
atccactgct tgcttgacat ttaaatgctt gatgggtgag tggcccacta tggatcagct 5760 
tgctggttta gtctactccg cgttcaatcc ggccgcagga gttgtgggcg tcttgtcagc 5820 
ttgtgcaatg tttgctttga caacagcagg gccagatcac tggcccaaca gacttcttac 5880 
tatgcttgct aggagcaaca ctgtatgtaa tgagtacttt attgccactc gtgacatccg 5940 
caggaagata ctgggcattc tggaggcatc taccccctgg agtgtcatat cagcttgcat 6000 
ccgttggctc cacaccccga cggaggatga ttgcggcctc attgcttggg gtctagagat 6060 
ttggcagtat gtgtgcaatt tctttgtgat ttgctttaat gtccttaaag ctggagttca 6120 
gagcatggtt aacattcctg gttgtccttt ctacagctgc cagaaggggt acaagggccc 6180 
ctggattgga tcaggtatgc tccaagcacg ctgtccatgc ggtgctgaac tcatcttttc 6240 
tgttgagaat ggttttgcaa aactttacaa aggacccaga acttgttcaa attactggag 6300 
aggggctgtt ccagtcaacg ctaggctgtg tgggtcggct agaccggacc caactgattg 6360 
gactagtctt gtcgtcaatt atggcgttag ggactactgt aaatatgaga aaatgggaga 6420 
tcacattttt gttacagcag tatcctctcc aaatgtctgt ttcacccagg tgcccccaac 6480 
cttgagagct gcagtggccg tggacggcgt acaggttcag tgttatctag gtgagcccaa 6540 
aactccttgg acgacatctg cttgctgtta cggtcctgac ggtaagggta aaactgttaa 6600 
gcttcccttc cgcgttgacg gtcacacacc tggtgtgcgc atgcaactta atttgcgtga 6660 
tgcacttgag acaaatgact gtaattccac aaacaacact cctagtgatg aagccgcagt 6720 
gtccgctctt gttttcaaac aggagttgcg gcgtacaaac caattgcttg aggcaatttc 6780 
agctggcgtt gacaccacca aactgccagc cccctccatc gaagaggtag tggtaagaaa 6840 
gcgccagttc cgggcaagaa ctggttcgct taccttgcct ccccctccga gatccgtccc 6900 
aggagtgtca tgtcctgaaa gcctgcaacg aagtgacccg ttagaaggtc cttcaaacct 6960 
ccctccttca ccacctgttc tacagttggc catgccgatg cccctgttgg gagcgggtga 7020 
gtgtaaccct ttcactgcaa ttggatgtgc aatgaccgaa acaggcggag gccctgatga 7080 
tttacccagt taccctccca aaaaggaggt ctctgaatgg tcagacgaaa gttggtcgac 7140 
ggctacaacc gcttccagct acgttactgg ccccccgtac cctaagatac ggggaaagga 7200 
ttccactcag tcagcccccg ccaaacggcc tacaaaaaag aagttgggaa agagtgagtt 7260 
ttcgtgcagc atgagctaca cctggaccga cgtgattagc ttcaaaactg cttctaaagt 7320 
tctgtctgca actcgggcca tcactagtgg tttcctcaaa caaagatcat tggtgtatgt 7380 
gactgagccg cgggatgcgg agcttagaaa acaaaaagtc actattaata gacaacctct 7440 
gttcccccca tcataccaca agcaagtgag attggctaag gaaaaagctt caaaagttgt 7500 
cggtgtcatg tgggactatg atgaagtagc agctcacacg ccctctaagt ctgctaagtc 7560 
ccacatcact ggccttcggg gcactgatgt tcgttctgga gcagcccgca aggctgttct 7620 
ggacttgcag aagtgtgtcg aggcaggtga gataccgagt cattatcggc aaactgtgat 7680 
agttccaaag gaggaggtct tcgtgaagac cccccagaaa ccaacaaaga aacccccaag 7740 
gcttatctcg tacccccacc ttgaaatgag atgtgttgag aagatgtact acggtcaggt 7800 
tgctcctgac gtagttaaag ctgtcatggg agatgcgtac gggtttgtag atccacgtac 7860 
ccgtgtcaag cgtctgttgt cgatgtggtc acccgatgca gtcggagcca catgcgatac 7920 
agtgtgtttt gacagtacca tcacacccga ggatatcatg gtggagacag acatctactc 7980 
agcagctaaa ctcagtgacc aacaccgagc tggcattcac accattgcga ggcagttata 8040 
cgctggagga ccgatgatcg cttatgatgg ccgagagatc ggatatcgta ggtgtaggtc 8100 
ttccggcgtc tatactacct caagttccaa cagtttgacc tgctggctga aggtaaatgc 8160 
tgcagccgaa caggctggca tgaagaaccc tcgcttcctt atttgcggcg atgattgcac 8220 
cgtaatttgg aagagcgccg gagcagatgc agacaaacaa gcaatgcgtg tctttgctag 8280 
ctggatgaag gtgatgggtg caccacaaga ttgtgtgcct caacccaaat acagtttgga 8340 
agaattaaca tcatgctcat caaatgttac ctctggaatt accaaaagtg gcaagcctta 8400 
ctactttctt acaagagatc ctcgtatccc ccttggcagg tgctctgccg agggtctggg 8460 
atacaacccc agtgctgcgt ggattgggta tctaatacat cactacccat gtttgtgggt 8520 
tagccgtgtg ttggctgtcc atttcatgga gcagatgctc tttgaggaca aacttcccga 8580 
gactgtgacc tttgactggt atgggaaaaa ttatacggtg cctgtagaag atctgcccag 8640 
catcattgct ggtgtgcacg gtattgaggc tttctcggtg gtgcgctaca ccaacgctga 8700 
gatcctcaga gtttcccaat cactaacaga catgaccatg ccccccctgc gagcctggcg 8760 
aaagaaagcc agggcggtcc tcgccagcgc caagaggcgt ggcggagcac acgcaaaatt 8820 
ggctcgcttc cttctctggc atgctacatc tagacctcta ccagatttgg ataagacgag 8880 
cgtggctcgg tacaccactt tcaattattg tgatgtttac tccccggagg gggatgtgtt 8940 
tattacacca cagagaagat tgcagaagtt ccttgtgaag tatttggctg tcattgtttt 9000 
tgccctaggg ctcattgctg ttggattagc catcagctga acccccaaat tcaaaattaa 9060 
ctaacagttt tttttttttt tttttttttt agggcagcgg caacagggga gaccccgggc 9120 
ttaacgaccc cgccgatgtg agtttggcga ccatggtgga tcagaaccgt ttcgggtgaa 9180 
gccatggtct gaaggggatg acgtcccttc tggctcatcc acaaaaaccg tctcgggtgg 9240 
gtgaggagtc ctggctgtgt gggaagcagt cagtataatt cccgtcgtgt gtggtgacgc 9300 
ctcacgacgt atttgtccgc tgtgcagagc gtagtaccaa gggctgcacc ccggtttttg 9360 
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ttccaagcgg agggcaaccc ccgcttggaa ttaaaaact 



9399 



<210> 2 
<211> 2864 
<212> PRT 
<213> GBV-B virus 

<400> 2 
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930 935 940 

Leu Gin Cys Leu Ser Glu Arg Gly Thr Leu Ser Ala Met Ala val Val 
945 950 955 960 

Met Thr Gly lie Asp Pro Arg Thr Trp Thr Gly Thr lie Phe Arg Leu 

965 " 970 975 

Gly Ser Leu Ala Thr Ser Tyr Met Gly Phe val Cys Asp Asn val Leu 

980 985 990 

Tyr Thr Ala His His Gly Ser Lys Gly Arg Arg Leu Ala His Pro Thr 

995 1000 " 1005 

Gly Ser lie His Pro lie Thr Val Asp Ala Ala Asn Asp Gin Asp lie 

1010 1015 1020 

Tyr Gin Pro Pro Cys Gly Ala Gly Ser Leu Thr Arg Cys Ser Cys Gly 
1025 1030 1035 1040 

Glu Thr Lys Gly Tyr Leu Val Thr Arg Leu Gly Ser Leu val Glu val 

1045 1050 1055 

Asn Lys Ser Asp Asp Pro Tyr Trp Cys val Cys Gly Ala Leu Pro Met 

1060 1065 1070 

Ala val Ala Lys Gly Ser Ser Gly Ala Pro lie Leu Cys Ser Ser Gly 

1075 ' 1080 1085 

His Val lie Gly Met Phe Thr Ala Ala Arg Asn Ser Gly Gly Ser val 

1090 1095 " 1100 

Ser Gin lie Arg Val Arg Pro Leu Val Cys Ala Gly Tyr His Pro Gin 
1105 1110 1115 1120 

Tyr Thr Ala His Ala Thr Leu Asp Thr Lys Pro Thr val Pro Asn Glu 

1125 1130 1135 

Tyr Ser val Gin lie Leu lie Ala Pro Thr Gly Ser Gly Lys Ser Thr 

1140 1145 1150 

Lys Leu Pro Leu Ser Tyr Met Gin Glu Lys Tyr Glu Val Leu Val Leu 

1155 1160 ' 1165 

Asn Pro Ser Val Ala Thr Thr Ala Ser Met Pro Lys Tyr Met His Ala 

1170 1175 1180 

Thr Tyr Gly val Asn Pro Asn Cys Tyr Phe Asn Gly Lys Cys Thr Asn 
1185 1190 ' 1195 1200 

Thr Gly Ala Ser Leu Thr Tyr Ser Thr Tyr. Gly Met Tyr Leu Thr Gly 

1205 " 1210 1215 

Ala Cys Ser Arg Asn Tyr Asp val lie lie Cys Asp Glu Cys His Ala 

1220 1225 1230 

Thr Asp Ala Thr Thr Val Leu Gly lie Gly Lys val Leu Thr Glu Ala 

1235 1240 1245 

Pro Ser Lys Asn val Arg Leu val val Leu Ala Thr Ala Thr Pro Pro 

1250 1255 1260 

Gly val lie Pro Thr Pro His Ala Asn lie Thr Glu lie Gin Leu Thr 
1265 1270 1275 1280 

Asp Glu Gly Thr lie Pro Phe His Gly Lys Lys lie Lys Glu Glu Asn 

1285 1290 1295 

Leu Lys Lys Gly Arg His Leu lie Phe Glu Ala Thr Lys Lys His Cys 

1300 1305 1310 

Asp Glu Leu Ala Asn Glu Leu Ala Arg Lys Gly lie Thr Ala Val Ser 

1315 1320 ' 1325 

Tyr Tyr Arg Gly Cys Asp lie Ser Lys lie Pro Glu Gly Asp Cys val 

1330 1335 ' 1340 

val val Ala Thr Asp Ala Leu Cys Thr Gly Tyr Thr Gly Asp Phe Asp 
1345 1350 1355 1360 

Ser val Tyr Asp Cys Ser Leu Met val Glu Gly Thr Cys His val Asp 

1365 1370 1375 

Leu Asp Pro Thr Phe Thr Met Gly val Arg val Cys Gly val Ser Ala 

1380 1385 ' 1390 

lie Val Lys Gly Gin Arg Arg Gly Arg Thr Gly Arg Gly Arg Ala Gly 

1395 1400 " 1405 

lie Tyr Tyr Tyr val Asp Gly Ser Cys Thr Pro Ser Gly Met val Pro 

1410 1415 1420 

Glu Cys Asn lie val Glu Ala Phe Asp Ala Ala Lys Ala Trp Tyr Gly 
1425 1430 1435 1440 
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Leu Ser Ser Thr G"lu Ala Gin Thr lie Leu Asp Thr Tyr Arg Thr Gin 

1445 1450 1455 

Pro Gly Leu Pro Ala lie Gly Ala Asn Leu Asp Glu Trp Ala Asp Leu 

1460 1465 1470 

Phe Ser Met Val Asn Pro Glu Pro Ser Phe Val Asn Thr Ala Lys Arg 

1475 1480 1485 

Thr Ala Asp Asn Tyr val Leu Leu Thr Ala Ala Gin Leu Gin Leu Cys 

1490 1495 1500 

His Gin Tyr Gly Tyr Ala Ala Pro Asn Asp Ala Pro Arg Trp Gin Gly 
1505 1510 1515 ' 1520 

Ala Arg Leu Gly Lys Lys Pro Cys Gly val Leu Trp Arg Leu Asp Gly 

1525 1530 1535 

Ala Asp Ala Cys Pro Gly Pro Glu Pro Ser Glu val Thr Arg Tyr Gin 

1540 1545 1550 

Met Cys Phe Thr Glu val Asn Thr Ser Gly Thr Ala Ala Leu Ala Val 

1555 1560 1565 

Gly Val Gly val Ala Met Ala Tyr Leu Ala lie Asp Thr Phe Gly Ala 

1570 1575 1580 

Thr Cys Val Arg Arg Cys Trp Ser lie Thr Ser val Pro Thr Gly Ala 
1585 " 1590 1595 1600 

Thr Val Ala Pro val val Asp Glu Glu Glu lie val Glu Glu Cys Ala 

1605 1610 1615 

Ser Phe lie Pro Leu Glu Ala Met val Ala Ala lie Asp Lys Leu Lys 

1620 1625 1630 

Ser Thr lie Thr Thr Thr Ser Pro Phe Thr Leu Glu Thr Ala Leu Glu 

1635 1640 1645 

Lys Leu Asn Thr Phe Leu Gly Pro His Ala Ala Thr lie Leu Ala lie 

1650 1655 1660 

lie Glu Tyr Cys Cys Gly Leu Val Thr Leu Pro Asp Asn Pro Phe Ala 
1665 1670 1675 1680 

Ser Cys val Phe Ala Phe lie Ala Gly lie Thr Thr Pro Leu Pro His 

1685 1690 1695 

Lys lie Lys Met Phe Leu Ser Leu Phe Gly Gly Ala lie Ala Ser Lys 

1700 1705 1710 

Leu Thr Asp Ala Arg Gly Ala Leu Ala Phe Met Met Ala Gly Ala Ala 

1715 ~ 1720 1725 

Gly Thr Ala Leu Gly Thr Trp Thr Ser val Gly Phe val Phe Asp Met 

1730 1735 1740 

Leu Gly Gly Tyr Ala Ala Ala Ser Ser Thr Ala Cys Leu Thr Phe Lys 
1745 * 1750 1755 1760 

Cys Leu Met Gly Glu Trp Pro Thr Met Asp Gin Leu Ala Gly Leu Val 

1765 1770 1775 

Tyr Ser Ala Phe Asn Pro Ala Ala Gly Val val Gly val Leu Ser Ala 

1780 1785 1790 

Cys Ala Met Phe Ala Leu Thr Thr Ala Gly Pro Asp His Trp Pro Asn 

1795 1800 1805 

Arg Leu Leu Thr Met Leu Ala Arg Ser Asn Thr val Cys Asn Glu Tyr 

1810 1815 1820 

Phe lie Ala Thr Arg Asp lie Arg Arg Lys lie Leu Gly lie Leu Glu 
1825 1830 1835 1840 

Ala Ser Thr Pro Trp Ser Val lie Ser Ala Cys lie Arg Trp Leu His 

1845 1850 1855 

Thr Pro Thr Glu Asp Asp Cys Gly Leu lie Ala Trp Gly Leu Glu lie 

1860 1865 1870 

Trp Gin Tyr Val Cys Asn Phe Phe val lie Cys Phe Asn Val Leu Lys 

1875 1880 1885 

Ala Gly val Gin Ser Met val Asn lie Pro Gly Cys Pro Phe Tyr Ser 

1890 1895 1900 

Cys Gin Lys Gly Tyr Lys Gly Pro Trp lie Gly Ser Gly Met Leu Gin 
1905 1910 1915 1920 

Ala Arg Cys Pro Cys Gly Ala Glu Leu lie Phe Ser Val Glu Asn Gly 

1925 1930 1935 

Phe Ala Lys Leu Tyr Lys Gly Pro Arg Thr Cys Ser Asn Tyr Trp Arg 
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1940 1945 1950 

Gly Ala val Pro Val Asn Ala Arg Leu Cys Gly Ser Ala Arg Pro Asp 

1955 1960 1965 

Pro Thr Asp Trp Thr Ser Leu val val Asn Tyr Gly Val Arg Asp Tyr 

1970 1975 1980 

Cys Lys Tyr Glu Lys Met Gly Asp His lie Phe Val Thr Ala val Ser 
1985 1990 1995 2000 

Ser Pro Asn val Cys Phe Thr Gin Val Pro Pro Thr Leu Arg Ala Ala 

2005 2010 2015 

val Ala val Asp Gly val Gin val Gin Cys Tyr Leu Gly Glu Pro Lys 

2020 2025 2030 

Thr Pro Trp Thr Thr Ser Ala Cys Cys Tyr Gly Pro Asp Gly Lys Gly 

2035 2040 2045 

Lys Thr Val Lys Leu Pro Phe Arg Val Asp Gly His Thr Pro Gly val 

2050 2055 2060 

Arg Met Gin Leu Asn Leu Arg Asp Ala Leu Glu Thr Asn Asp Cys Asn 
2065 2070 2075 2080 

Ser Thr Asn Asn Thr Pro Ser Asp Glu Ala Ala val Ser Ala Leu Val. 

2085 2090 2095 

Phe Lys Gin Glu Leu Arg Arg Thr Asn Gin Leu Leu Glu Ala lie Ser 

2100 " 2105 2110 

Ala Gly val Asp Thr Thr Lys Leu Pro Ala Pro Ser lie Glu Glu Val 

2115 2120 2125 

val val Arg Lys Arg Gin Phe Arg Ala Arg Thr Gly Ser Leu Thr Leu 

2130 2135 2140 

Pro Pro Pro Pro Arg Ser Val Pro Gly Val Ser Cys Pro Glu Ser Leu 
2145 2150 2155 2160 

Gin Arg Ser Asp Pro Leu Glu Gly Pro Ser Asn Leu Pro Pro Ser Pro 

2165 2170 2175 

Pro Val Leu Gin Leu Ala Met Pro Met Pro Leu Leu Gly Ala Gly Glu 

2180 2185 2190 

Cys Asn Pro Phe Thr Ala lie Gly Cys Ala Met Thr Glu Thr Gly Gly 

2195 2200 2205 

Gly Pro Asp Asp Leu Pro Ser Tyr Pro Pro Lys Lys Glu Val Ser Glu 

2210 2215 2220 

Trp Ser Asp Glu Ser Trp Ser Thr Ala Thr Thr Ala Ser Ser Tyr val 
2225 2230 2235 2240 

Thr Gly Pro Pro Tyr Pro Lys lie Arg Gly Lys Asp Ser Thr Gin Ser 

2245 2250 2255 

Ala Pro Ala Lys Arg Pro Thr Lys Lys Lys Leu Gly Lys Ser Glu Phe 

2260 2265 2270 

Ser Cys Ser Met Ser Tyr Thr Trp Thr Asp Val lie Ser Phe Lys Thr 

2275 2280 2285 

Ala Ser Lys Val Leu Ser Ala Thr Arg Ala lie Thr Ser Gly Phe Leu 

2290 2295 2300 

Lys Gin Arg Ser Leu Val Tyr Val Thr Glu Pro Arg Asp Ala Glu Leu 
2305 ~ 2310 2315 2320 

Arg Lys Gin Lys val Thr lie Asn Arg Gin Pro Leu Phe Pro Pro Ser 

2325 2330 2335 

Tyr His Lys Gin val Arg Leu Ala Lys Glu Lys Ala Ser Lys Val val 

2340 " 2345 2350 

Gly Val Met Trp Asp Tyr Asp Glu val Ala Ala His Thr Pro Ser Lys 

2355 2360 2365 

Ser Ala Lys Ser His lie Thr Gly Leu Arg Gly Thr Asp Val Arg Ser 

2370 2375 2380 

Gly Ala Ala Arg Lys Ala val Leu Asp Leu Gin Lys Cys val Glu Ala 
2385 2390 2395 2400 

Gly Glu lie Pro Ser His Tyr Arg Gin Thr val lie val Pro Lys Glu 

2405 " 2410 2415 

Glu val Phe val Lys Thr Pro Gin Lys Pro Thr Lys Lys Pro Pro Arg 

2420 2425 2430 

Leu lie Ser Tyr Pro His Leu Glu Met Arg Cys val Glu Lys Met Tyr 
2435 2440 ~ 2445 
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Tyr Gly Gin val Ala Pro Asp val val Lys Ala val Met Gly Asp Ala 

2450 2455 2460 

Tyr Gly Phe val Asp Pro Arg Thr Arg val Lys Arg Leu Leu Ser Met 
2465 2470 " 2475 2480 

Trp Ser Pro Asp Ala Val Gly Ala Thr Cys Asp Thr val Cys Phe Asp 

2485 2490 2495 

Ser Thr lie Thr Pro Glu Asp lie Met val Glu Thr Asp lie Tyr Ser 

2500 2505 2510 

Ala Ala Lys Leu Ser Asp Gin His Arg Ala Gly lie His Thr lie Ala 

2515 2520 2525 

Arg Gin Leu Tyr Ala Gly Gly Pro Met lie Ala Tyr Asp Gly Arg Glu 

2530 2535 2540 

lie Gly Tyr Arg Arg Cys Arg Ser Ser Gly Val Tyr Thr Thr Ser Ser 
2545 " 2550 2555 2560 

Ser Asn Ser Leu Thr Cys Trp Leu Lys Val Asn Ala Ala Ala Glu Gin 

2565 2570 2575 

Ala Gly Met Lys Asn Pro Arg Phe Leu lie Cys Gly Asp Asp Cys Thr 

2580 2585 2590 

val lie Trp Lys Ser Ala Gly Ala Asp Ala Asp Lys Gin Ala Met Arg 

2595 2600 2605 

val Phe Ala Ser Trp Met Lys val Met Gly Ala Pro Gin Asp Cys val 

2610 2615 2620 

Pro Gin Pro Lys Tyr Ser Leu Glu Glu Leu Thr Ser Cys Ser Ser Asn 
2625 2630 2635 2640 

Val Thr Ser Gly lie Thr Lys Ser Gly Lys Pro Tyr Tyr Phe Leu Thr 

2645 2650 2655 

Arg Asp Pro Arg lie Pro Leu Gly Arg Cys Ser Ala Glu Gly Leu Gly 

2660 2665 2670 

Tyr Asn Pro Ser Ala Ala Trp lie Gly Tyr Leu lie His His Tyr Pro 

2675 2680 2685 

Cys Leu Trp val Ser Arg val Leu Ala val His Phe Met Glu Gin Met 

2690 2695 2700 

Leu Phe Glu Asp Lys Leu Pro Glu Thr Val Thr Phe Asp Trp Tyr Gly 
2705 2710 2715 2720 

Lys Asn Tyr Thr val Pro val Glu Asp Leu Pro Ser lie lie Ala Gly 

2725 2730 2735 

val His Gly lie Glu Ala Phe Ser Val Val Arg Tyr Thr Asn Ala Glu 

2740 2745 2750 

lie Leu Arg Val Ser Gin Ser Leu Thr Asp Met Thr Met Pro Pro Leu 

2755 2760 2765 

Arg Ala Trp Arg Lys Lys Ala Arg Ala Val Leu Ala Ser Ala Lys Arg 

2770 ~ 2775 2780 

Arg Gly Gly Ala His Ala Lys Leu Ala Arg Phe Leu Leu Trp His Ala 
2785 * 2790 2795 2800 

Thr Ser Arg Pro Leu Pro Asp Leu Asp Lys Thr Ser val Ala Arg Tyr 

2805 2810 2815 

Thr Thr Phe Asn Tyr Cys Asp Val Tyr Ser Pro Glu Gly Asp val Phe 

2820 2825 2830 

lie Thr Pro Gin Arg Arg Leu Gin Lys Phe Leu val Lys Tyr Leu Ala 

2835 ~ " 2840 - 2845 

val lie val Phe Ala Leu Gly Leu lie Ala val Gly Leu Ala lie Ser 
2850 2855 2860 

<210> 3 
<211> 9139 
<212> DNA 
<213> GBV-B virus 

<400> 3 

accacaaaca ctccagtttg ttacactccg ctaggaatgc tcctggagca ccccccctag 60 
cagggcgtgg gggatttccc ctgcccgtct gcagaagggt ggagccaacc accttagtat 120 
gtaggcggcg ggactcatga cgctcgcgtg atgacaagcg ccaagcttga cttggatggc 180 
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cctgatgggc gttcatgggt tcggtggtgg tggcgcttta ggcagcctcc acgcccacca 240 
cctcccagat agagcggcgg cactgtaggg aagaccgggg accggtcact accaaggacg 300 
cagacctctt tttgagtatc acgcctccgg aagtagttgg gcaagcccac ctatatgtgt 360 
tgggatggtt ggggttagcc atccataccg tactgcctga tagggtcctt gcgaggggat 420 
ctgggagtct cgtagaccgt agcacatgcc tgttatttct actcaaacaa gtcctgtacc 480 
tgcgcccaga acgcgcaaga acaagcagac gcaggcttca tatcctgtgt ccattaaaac 540 
atctgttgaa aggggacaac gagcaaagcg caaagtccag cgcgatgctc ggcctcgtaa 600 
ttacaaaatt gctggtatcc atgatggctt gcagacattg gctcaggctg ctttgccagc 660 
tcatggttgg ggacgccaag accctcgcca taagtctcgc aatcttggaa tccttctgga 720 
ttaccctttg gggtggattg gtgatgttac aactcacaca cctctagtag gcccgctggt 780 
ggcaggagcg gtcgttcgac cagtctgcca gatagtacgc ttgctggagg atggagtcaa 840 
ctgggctact ggttggttcg gtgtccacct ttttgtggta tgtctgctat ctttggcctg 900 
tccctgtagt ggggcgcggg tcactgaccc agacacaaat accacaatcc tgaccaattg 960 
ctgccagcgt aatcaggtta tctattgttc tccttccact tgcctacacg agcctggttg 1020 
tgtgatctgt gcggacgagt gctgggttcc cgccaatccg tacatctcac acccttccaa 1080 
ttggactggc acggactcct tcttggctga ccacattgat tttgttatgg gcgctcttgt 1140 
gacctgtgac gcccttgaca ttggtgagtt gtgtggtgcg tgtgtattag tcggtgactg 1200 
gcttgtcagg cactggctta ttcacataga cctcaatgaa actggtactt gttacctgga 1260 
agtgcccact ggaatagatc ctgggttcct agggtttatc gggtggatgg ccggcaaggt 1320 
cgaggctgtc atcttcttga ccaaactggc ttcacaagta ccatacgcta ttgcgactat 1380 
gtttagcagt gtacactacc tggcggttgg cgctctgatc tactatgcct ctcggggcaa 1440 
gtggtatcag ttgctcctag cgcttatgct ttacatagaa gcgacctctg gaaaccccat 1500 
cagggtgccc actggatgct caatagctga gttttgctcg cctttgatga taccatgtcc 1560 
ttgccactct tatttgagtg agaatgtgtc agaagtcatt tgttacagtc caaagtggac 1620 
caggcctatc actctagagt ataacaactc catatcttgg tacccctata caatccctgg 1680 
tgcgagggga tgtatggtta aattcaaaaa taacacatgg ggttgctgcc gtattcgcaa 1740 
tgtgccatcg tactgcacta tgggcactga tgcagtgtgg aacgacactc gcaacactta 1800 
cgaagcatgc ggtgtaacac catggctaac aaccgcatgg cacaacggct cagccctgaa 1860 
attggctata ttacaatacc ctgggtctaa agaaatgttt aaacctcata attggatgtc 1920 
aggccatttg tattttgagg gatcagatac ccctatagtt tacttttatg accctgtgaa 1980 
ttccactctc ctaccaccgg agaggtgggc taggttgccc ggtaccccac ctgtggtacg 2040 
tggttcttgg ttacaggttc cgcaagggtt ttacagtgat gtgaaagacc tagccacagg 2100 
attgatcacc aaagacaaag cctggaaaaa ttatcaggtc ttatattccg ccacgggtgc 2160 
tttgtctctt acgggagtta ccaccaaggc cgtggtgcta attctgttgg ggttgtgtgg 2220 
cagcaagtat cttattttag cctacctctg ttacttgtcc ctttgttttg ggcgcgcttc 2280 
tggttaccct ttgcgtcctg tgctcccatc ccagtcgtat ctccaagctg gctgggatgt 2340 
tttgtctaaa gctcaagtag ctccttttgc tttgattttc ttcatctgtt gctatctccg 2400 
ctgcaggcta cgttatgctg cccttttagg gtttgtgccc atggctgcgg gcttgcccct 2460 
aactttcttt gttgcagcag ctgctgccca accagattat gactggtggg tgcgactgct 2520 
agtggcaggg ttagttttgt gggccggccg taaccgtggt caccgcatag ctctgcttgt 2580 
aggtccttgg cctctggtag cgcttttaac cctcttgcat ttggttacgc ctgcttcagc 2640 
ttttgatacc gagataattg gagggctgac aataccacct gtagtagcat tagttgtcat 2700 
gtctcgtttt ggcttctttg ctcacttgtt acctcgctgt gctttagtta actcctatct 2760 
ttggcaacgt tgggagaatt ggttttggaa cgttacacta agaccggaga ggtttttcct 2820 
tgtgctggtt tgtttccccg gtgcgacata tgacgcgctg gtgactttct gtgtgtgtca 2880 
cgtagctctt ctatgtttaa catccagtgc agcatcgttc tttgggactg actctagggt 2940 
tagggcccat agaatgttgg tgcgtctcgg aaagtgtcat gcttggtatt ctcattatgt 3000 
tcttaagttt ttcctcttag tgtttggtga gaatggtgtg tttttctata agcacttgca 3060 
tggtgatgtc ttgcctaatg attttgcctc gaaactacca ttgcaagagc catttttccc 3120 
ttttgaaggc aaggcaaggg tctataggaa tgaaggaaga cgcttggcgt gtggggacac 3180 
ggttgatggt ttgcccgttg ttgcgcgtct cggcgacctt gttttcgcag ggttggctat 3240 
gccgccagat gggtgggcca ttaccgcacc ttttacgctg cagtgtctct ctgaacgtgg 3300 
cacgctgtca gcgatggcag tggtcatgac tggtatagac ccccgaactt ggactggaac 3360 
tatcttcaga ttaggatctc tggccactag ctacatggga tttgtttgtg acaacgtgtt 3420 
gtatactgct caccatggca gcaaggggcg ccggttggct catcccacag gctctataca 3480 
cccaataacc gttgacgcgg ctaatgacca ggacatctat caaccaccat gtggagctgg 3540 
gtcccttact cggtgctctt gcggggagac caaggggtat ctggtaacac gactggggtc 3600 
attggttgag gtcaacaaat ccgatgaccc ttattggtgt gtgtgcgggg cccttcccat 3660 
ggctgttgcc aagggttctt caggtgcccc gattctgtgc tcctccgggc atgttattgg 3720 
gatgttcacc gctgctagaa attctggcgg ttcagtcagt cagattaggg ttaggccgtt 3780 
ggtgtgtgct ggataccatc cccagtacac agcacatgcc actcttgata caaaacctac 3840 
tgtgcctaac gagtattcag tgcaaatttt aattgccccc actggcagcg gcaagtcaac 3900 
caaattacca ctttcttaca tgcaggagaa gtatgaggtc ttggtcctaa atcccagtgt 3960 
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ggctacaaca gcatcaatgc caaagtacat gcacgcgacg tacggcgtga atccaaattg 4020 
ctattttaat ggcaaatgta ccaacacagg ggcttcactt acgtacagca catatggcat 4080 
gtacctgacc ggagcatgtt cccggaacta tgatgtaatc atttgtgacg aatgccatgc 4140 
taccgatgca accaccgtgt tgggcattgg aaaggtccta accgaagctc catccaaaaa 4200 
tgttaggcta gtggttcttg ccacggctac cccccctgga gtaatcccta caccacatgc 4260 
caacataact gagattcaat taaccgatga aggcactatc ccctttcatg gaaaaaagat 4320 
taaggaggaa aatctgaaga aagggagaca ccttatcttt gaggctacca aaaaacactg 4380 
tgatgagctt gctaacgagt tagctcgaaa gggaataaca gctgtctctt actatagggg 4440 
atgtgacatc tcaaaaatcc ctgagggcga ctgtgtagta gttgccactg atgccttgtg 4500 
tacagggtac actggtgact ttgattccgt gtatgactgc agcctcatgg tagaaggcac 4560 
atgccatgtt gaccttgacc ctactttcac catgggtgtt cgtgtgtgcg gggtttcagc 4620 
aatagttaaa ggccagcgta ggggccgcac aggccgtggg agagctggca tatactacta 4680 
tgtagacggg agttgtaccc cttcgggtat ggttcctgaa tgcaacattg ttgaagcctt 4740 
cgacgcagcc aaggcatggt atggtttgtc atcaacagaa gctcaaacta ttctggacac 4800 
ctatcgcacc caacctgggt tacctgcgat aggagcaaat ttggacgagt gggctgatct 4860 
cttttctatg gtcaaccccg aaccttcatt tgtcaatact gcaaaaagaa ctgctgacaa 4920 
ttatgttttg ttgactgcag cccaactaca actgtgtcat cagtatggct atgctgctcc 4980 
caatgacgca ccacggtggc agggagcccg gcttgggaaa aaaccttgtg gggttctgtg 5040 
gcgcttggac ggcgctgacg cctgtcctgg cccagagccc agcgaggtga ccagatacca 5100 
aatgtgcttc actgaagtca atacttctgg gacagccgca ctcgctgttg gcgttggagt 5160 
ggctatggct tatctagcca ttgacacttt tggcgccact tgtgtgcggc gttgctggtc 5220 
tattacatca gtccctaccg gtgctactgt cgccccagtg gttgacgaag aagaaatcgt 5280 
ggaggagtgt gcatcattca ttcccttgga ggccatggtt gctgcaattg acaagctgaa 5340 
gagtacaatc accacaacta gtcctttcac attggaaacc gcccttgaaa aacttaacac 5400 
ctttcttggg cctcatgcag ctacaatcct tgctatcata gagtattgct gtggtttagt 5460 
cactttacct gacaatccct ttgcatcatg cgtgtttgct ttcattgcgg gtattactac 5520 
cccactacct cacaagatca aaatgttcct gtcattattt ggaggcgcaa ttgcgtccaa 5580 
gcttacagac gctagaggcg cactggcgtt catgatggcc ggggctgcgg gaacagctct 5640 
tggtacatgg acatcggtgg gttttgtctt tgacatgcta ggcggctatg ctgccgcctc 5700 
atccactgct tgcttgacat ttaaatgctt gatgggtgag tggcccacta tggatcagct 5760 
tgctggttta gtctactccg cgttcaatcc ggccgcagga gttgtgggcg tcttgtcagc 5820 
ttgtgcaatg tttgctttga caacagcagg gccagatcac tggcccaaca gacttcttac 5880 
tatgcttgct aggagcaaca ctgtatgtaa tgagtacttt attgccactc gtgacatccg 5940 
caggaagata ctgggcattc tggaggcatc taccccctgg agtgtcatat cagcttgcat 6000 
ccgttggctc cacaccccga cggaggatga ttgcggcctc attgcttggg gtctagagat 6060 
ttggcagtat gtgtgcaatt tctttgtgat ttgctttaat gtccttaaag ctggagttca 6120 
gagcatggtt aacattcctg gttgtccttt ctacagctgc cagaaggggt acaagggccc 6180 
ctggattgga tcaggtatgc tccaagcacg ctgtccatgc ggtgctgaac tcatcttttc 6240 
tgttgagaat ggttttgcaa aactttacaa aggacccaga acttgttcaa attactggag 6300 
aggggctgtt ccagtcaacg ctaggctgtg tgggtcggct agaccggacc caactgattg 6360 
gactagtctt gtcgtcaatt atggcgttag ggactactgt aaatatgaga aaatgggaga 6420 
tcacattttt gttacagcag tatcctctcc aaatgtctgt ttcacccagg tgcccccaac 6480 
cttgagagct gcagtggccg tggacggcgt acaggttcag tgttatctag gtgagcccaa 6540 
aactccttgg acgacatctg cttgctgtta cggtcctgac ggtaagggta aaactgttaa 6600 
gcttcccttc cgcgttgacg gtcacacacc tggtgtgcgc atgcaactta atttgcgtga 6660 
tgcacttgag acaaatgact gtaattccac aaacaacact cctagtgatg aagccgcagt 6720 
gtccgctctt gttttcaaac aggagttgcg gcgtacaaac caattgcttg aggcaatttc 6780 
agctggcgtt gacaccacca aactgccagc cccctccatc gaagaggtag tggtaagaaa 6840 
gcgccagttc cgggcaagaa ctggttcgct taccttgcct ccccctccga gatccgtccc 6900 
aggagtgtca tgtcctgaaa gcctgcaacg aagtgacccg ttagaaggtc cttcaaacct 6960 
ccctccttca ccacctgttc tacagttggc catgccgatg cccctgttgg gagcgggtga 7020 
gtgtaaccct ttcactgcaa ttggatgtgc aatgaccgaa acaggcggag gccctgatga 7080 
tttacccagt taccctccca aaaaggaggt ctctgaatgg tcagacgaaa gttggtcgac 7140 
ggctacaacc gcttccagct acgttactgg ccccccgtac cctaagatac ggggaaagga 7200 
ttccactcag tcagcccccg ccaaacggcc tacaaaaaag aagttgggaa agagtgagtt 7260 
ttcgtgcagc atgagctaca cctggaccga cgtgattagc ttcaaaactg cttctaaagt 7320 
tctgtctgca actcgggcca tcactagtgg tttcctcaaa caaagatcat tggtgtatgt 7380 
gactgagccg cgggatgcgg agcttagaaa acaaaaagtc actattaata gacaacctct 7440 
gttcccccca tcataccaca agcaagtgag attggctaag gaaaaagctt caaaagttgt 7500 
cggtgtcatg tgggactatg atgaagtagc agctcacacg ccctctaagt ctgctaagtc 7560 
ccacatcact ggccttcggg gcactgatgt tcgttctgga gcagcccgca aggctgttct 7620 
ggacttgcag aagtgtgtcg aggcaggtga gataccgagt cattatcggc aaactgtgat 7680 
agttccaaag gaggaggtct tcgtgaagac cccccagaaa ccaacaaaga aacccccaag 7740 
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gcttatctcg tacccccacc ttgaaatgag atgtgttgag aagatgtact acggtcaggt 7800 
tgctcctgac gtagttaaag ctgtcatggg agatgcgtac gggtttgtag atccacgtac 7860 
ccgtgtcaag cgtctgttgt cgatgtggtc acccgatgca gtcggagcca catgcgatac 7920 
agtgtgtttt gacagtacca tcacacccga ggatatcatg gtggagacag acatctactc 7980 
agcagctaaa ctcagtgacc aacaccgagc tggcattcac accattgcga ggcagttata 8040 
cgctggagga ccgatgatcg cttatgatgg ccgagagatc ggatatcgta ggtgtaggtc 8100 
ttccggcgtc tatactacct caagttccaa cagtttgacc tgctggctga aggtaaatgc 8160 
tgcagccgaa caggctggca tgaagaaccc tcgcttcctt atttgcggcg atgattgcac 8220 
cgtaatttgg aagagcgccg gagcagatgc agacaaacaa gcaatgcgtg tctttgctag 8280 
ctggatgaag gtgatgggtg caccacaaga ttgtgtgcct caacccaaat acagtttgga 8340 
agaattaaca tcatgctcat caaatgttac ctctggaatt accaaaagtg gcaagcctta 8400 
ctactttctt acaagagatc ctcgtatccc ccttggcagg tgctctgccg agggtctggg 8460 
atacaacccc agtgctgcgt ggattgggta tctaatacat cactacccat gtttgtgggt 8520 
tagccgtgtg ttggctgtcc atttcatgga gcagatgctc tttgaggaca aacttcccga 8580 
gactgtgacc tttgactggt atgggaaaaa ttatacggtg cctgtagaag atctgcccag 8640 
catcattgct ggtgtgcacg gtattgaggc tttctcggtg gtgcgctaca ccaacgctga 8700 
gatcctcaga gtttcccaat cactaacaga catgaccatg ccccccctgc gagcctggcg 8760 
aaagaaagcc agggcggtcc tcgccagcgc caagaggcgt ggcggagcac acgcaaaatt 8820 
ggctcgcttc cttctctggc atgctacatc tagacctcta ccagatttgg ataagacgag 8880 
cgtggctcgg tacaccactt tcaattattg tgatgtttac tccccggagg gggatgtgtt 8940 
tattacacca cagagaagat tgcagaagtt ccttgtgaag tatttggctg tcattgtttt 9000 
tgccctaggg ctcattgctg ttggattagc catcagctga acccccaaat tcaaaattaa 9060 
ctaacagttt tttttttttt tttttttttt agggcagcgg caacagggga gaccccgggc 9120 
ttaacgaccc cgcgatgtg 9139 

<210> 4 
<211> 9711 
<212> DNA 

<213> Hepatitis C virus 
<400> 4 

acccgcccct aataggggcg acactccgcc atgaatcact cccctgtgag gaactactgt 60 
cttcacgcag aaagcgtcta gccatggcgt tagtatgagt gtcgtacagc ctccaggccc 120 
ccccctcccg ggagagccat agtggtctgc ggaaccggtg agtacaccgg aattgccggg 180 
aagactgggt cctttcttgg ataaacccac tctatgcccg gccatttggg cgtgcccccg 240 
caagactgct agccgagtag cgttgggttg cgaaaggcct tgtggtactg cctgataggg 300 
tgcttgcgag tgccccggga ggtctcgtag accgtgcacc atgagcacaa atcctaaacc 360 
tcaaagaaaa accaaaagaa acaccaaccg tcgcccacaa gacgttaagt ttccgggcgg 420 
cggccagatc gttggcggag tatacttgtt gccgcgcagg ggccccaggt tgggtgtgcg 480 
cgcgacaagg aagacttcgg agcggtccca gccacgtgga aggcgccagc ccatccctaa 540 
agatcggcgc tccactggca aatcctgggg aaaaccagga tacccctggc ccctatacgg 600 
gaatgaggga ctcggctggg caggatggct cctgtccccc cgaggttccc gtccctcttg 660 
gggccccaat gacccccggc ataggtcgcg caacgtgggt aaggtcatcg ataccctaac 720 
gtgcggcttt gccgacctca tggggtacat ccctgtcgtg ggcgccccgc tcggcggcgt 780 
cgccagagct ctcgcgcatg gcgtgagagt cctggaggac ggggttaatt ttgcaacagg 840 
gaacttaccc ggttgctcct tttctatctt cttgctggcc ctgctgtcct gcatcaccac 900 
cccggtctcc gctgccgaag tgaagaacat cagtaccggc tacatggtga ctaacgactg 960 
caccaatgac agcattacct ggcagctcca ggctgctgtc ctccacgtcc ccgggtgcgt 1020 
cccgtgcgag aaagtgggga atgcatctca gtgctggata ccggtctcac cgaatgtggc 1080 
cgtgcagcgg cccggcgccc tcacgcaggg cttgcggacg cacatcgaca tggttgtgat 1140 
gtccgccacg ctctgctctg ccctctacgt gggggacctc tgcggtgggg tgatgctcgc 1200 
agcccaaatg ttcattgtct cgccgcagca ccactggttt gtccaagact gcaattgctc 1260 
catctaccct ggtaccatca ctggacaccg catggcatgg gacatgatga tgaactggtc 1320 
gcccacggct accatgatct tggcgtacgc gatgcgtgtc cccgaggtca ttatagacat 1380 
cattagcggg gctcattggg gcgtcatgtt cggcttggcc tacttctcta tgcagggagc 1440 
gtgggcgaaa gtcgttgtca tccttctgtt ggccgccggg gtggacgcgc gcacccatac 1500 
tgttgggggt tctgccgcgc agaccaccgg gcgcctcacc agcttatttg acatgggccc 1560 
caggcagaaa atccagctcg ttaacaccaa tggcagctgg cacatcaacc gcaccgccct 1620 
gaactgcaat gactccttgc acaccggctt tatcgcgtct ctgttctaca cccacagctt 1680 
caactcgtca ggatgtcccg aacgcatgtc cgcctgccgc agtatcgagg ccttccgggt 1740 
gggatggggc gccttgcaat atgaggataa tgtcaccaat ccagaggata tgagacccta 1800 
ttgctggcac tacccaccaa ggcagtgtgg cgtggtctcc gcgaagactg tgtgtggccc 1860 
agtgtactgt ttcaccccca gcccagtggt agtgggcacg accgacaggc ttggagcgcc 1920 
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cacttacacg tggggggaga atgagacaga tgtcttccta ttgaacagca ctcgaccacc 1980 
gctggggtca tggttcggct gcacgtggat gaactcttct ggctacacca agacttgcgg 2040 
cgcaccaccc tgccgtacta gagctgactt caacgccagc acggacctgt tgtgccccac 2100 
ggactgtttt aggaagcatc ctgataccac ttacctcaaa tgcggctctg ggccctggct 2160 
cacgccaagg tgcctgatcg actaccccta caggctctgg cattacccct gcacagttaa 2220 
ctataccatc ttcaaaataa ggatgtatgt gggaggggtt gagcacaggc tcacggctgc 2280 
atgcaatttc actcgtgggg atcgttgcaa cttggaggac agagacagaa gtcaactgtc 2340 
tcctttgttg cactccacca cggaatgggc cattttacct tgctcttact cggacctgcc 2400 
cgccttgtcg actggtcttc tccacctcca ccaaaacatc gtggacgtac aattcatgta 2460 
tggcctatca cctgccctca caaaatacat cgtccgatgg gagtgggtaa tactcttatt 2520 
cctgctctta gcggacgcca gggtttgcgc ctgcttatgg atgctcatct tgttgggcca 2580 
ggccgaagca gcactagaga agctggtcat cttgcacgct gcgagcgcag ctagctgcaa 2640 
tggcttccta tattttgtca tctttttcgt ggctgcttgg tacatcaagg gtcgggtagt 2700 
ccccttagct acctattccc tcactggcct gtggtccttt agcctactgc tcctagcatt 2760 
gccccaacag gcttatgctt atgacgcatc tgtgcatggc cagataggag cggctctgct 2820 
ggtaatgatc actctcttta ctctcacccc cgggtataag acccttctca gccggttttt 2880 
gtggtggttg tgctatcttc tgaccctggg ggaagctatg gtccaggagt gggcaccacc 2940 
tatgcaggtg cgcggtggcc gtgatggcat catatgggcc gtcgccatat tctacccagg 3000 
tgtggtgttt gacataacca agtggctctt ggcggtgctt gggcctgctt acctcctaaa 3060 
aggtgctttg acgcgcgtgc cgtacttcgt cagggctcac gctctactga ggatgtgcac 3120 
catggcaagg catctcgcgg ggggcaggta cgtccagatg gcgctactag cccttggcag 3180 
gtggactggc acttacatct atgaccacct cacccctatg tcggattggg ctgctagtgg 3240 
cctgcgggac ctggcggtcg ccgttgagcc tatcatcttc agtccgatgg agaagaaagt 3300 
cattgtctgg ggagcggaga cagctgcttg tggggacatt ttacacggac ttcccgtgtc 3360 
cgcccgactt ggtcgggagg tcctccttgg cccagctgat ggctatacct ccaaggggtg 3420 
gagtcttctc gcccccatca ctgcttacgc ccagcagaca cgtggccttt tgggcaccat 3480 
agtggtgagc atgacggggc gcgacaagac agaacaggct ggggaaattc aggtcctgtc 3540 
cacagtcact cagtccttcc tcggaacatc catctcgggg gttttgtgga ctgtctacca 3600 
tggagctggc aacaagactc tggccggctc acggggtccg gtcacgcaga tgtactccag 3660 
tgctgagggg gacttagtag ggtggcccag cccccctggg actaaatctt tggagccgtg 3720 
cacgtgtgga gcggtcgacc tgtacctggt cacgcggaac gctgatgtca tcccggctcg 3780 
aagacgcggg gacaaacggg gagcgctact ctccccgaga cctctttcca ccttgaaggg 3840 
gtcctcagga ggcccggtgc tatgccccag gggccacgct gtcggagtct tccgggcagc 3900 
tgtgtgctct cggggcgtgg ctaagtccat agatttcatc cccgttgaga cactcgacat 3960 
cgtcacgcgg tcccccacct ttagtgacaa cagcacacca cctgctgtgc cccagaccta 4020 
tcaggtcggg tacttgcatg ccccgactgg cagtggaaag agcaccaaag ttcctgtcgc 4080 
atatgctgct caggggtata aagtgctagt gcttaatccc tcagtggctg ccaccctggg 4140 
gtttggggcg tacttgtcta aggcacatgg catcaatccc aacattagga ctggagtcag 4200 
gactgtgacg accggggcgc ccatcacgta ctccacatat ggcaaattcc tcgccgatgg 4260 
gggctgtgcg ggcggcgcct acgacatcat catatgtgat gaatgccatg ccgtggactc 4320 
taccaccatc cttggcatcg gaacagtcct tgatcaagca gagacagctg gggtcagact 4380 
aactgtgctg gctacagcta cgccccctgg gtcagtgaca accccccacc ccaacataga 4440 
ggaggtggcc cttgggcagg agggcgagat ccccttctat gggagggcga ttcccctgtc 4500 
ttacatcaag ggaggaagac atctgatctt ctgccattca aagaaaaagt gtgacgagct 4560 
cgcggcggcc cttcggggta tgggcttgaa ctcagtggca tactacagag ggttggacgt 4620 
ctccgtaata ccaactcagg gagacgtagt ggtcgtcgcc accgacgccc tcatgacagg 4680 
gtatactggg gactttgact ccgtgatcga ctgcaacgta gcggtcactc aagttgtaga 4740 
cttcagttta gaccccacat tcaccataac cacacagatt gtccctcaag acgctgtctc 4800 
acgtagccag cgccggggtc gcacgggtag gggaagactg ggcatttata ggtatgtttc 4860 
cactggtgag cgagcctcag gaatgtttga cagtgtagtg ctctgtgagt gctacgacgc 4920 
aggggccgca tggtatgagc tcacaccatc ggagaccacc gtcaggctca gggcgtattt 4980 
caacacgccc ggtttgcctg tgtgccaaga ccatcttgag ttttgggagg cagttttcac 5040 
cggcctcaca cacatagatg cccacttcct ttcccaaaca aagcaatcgg gggaaaattt 5100 
cgcatactta acagcctacc aggctacagt gtgcgctagg gccaaagccc cccccccgtc 5160 
ctgggacgtc atgtggaagt gtttgactcg actcaagccc acactcgtgg gccccacacc 5220 
tctcctgtac cgcttgggct ctgttaccaa cgaggtcacc ctcacacatc ccgtgacgaa 5280 
atacatcgcc acctgcatgc aagccgacct tgaggtcatg accagcacat gggtcttggc 5340 
agggggagtc ttggcggccg tcgccgcgta ttgcctggcg accgggtgtg tttgcatcat 5400 
cggccgcttg cacattaacc agcgagccgt cgttgcgccg gacaaggagg tcctctatga 5460 
ggcttttgat gagatggagg aatgtgcctc tagggcggct ctcattgaag aggggcagcg 5520 
gatagccgag atgctgaagt ccaagatcca aggcttattg cagcaagctt ccaaacaagc 5580 
tcaagacata caacccactg tgcaggcttc atggcccaag gtagaacaat tctgggccaa 5640 
acacatgtgg aacttcatta gcggcatcca atacctcgca ggactatcaa cactgccagg 5700 

Page 13 



2540109_1.TXT 

gaaccctgca gtagcttcca tgatggcgtt cagtgccgcc ctcaccagtc cgctgtcaac 5760 
aagcaccact atccttctca acattttggg gggctggcta gcatcccaaa ttgcaccacc 5820 
cgcgggggcc actggcttcg ttgtcagtgg cctagtggga gctgccgtag gcagtatagg 5880 
cttaggtaag gtgctagtgg acatcctggc agggtatggt gcgggcattt cgggggctct 5940 
cgtcgcattc aagatcatgt ctggcgagaa gccctccatg gaggatgtcg tcaacttgct 6000 
gcctggaatt ctgtctccgg gtgccttggt agtgggagtc atctgcgcgg ccattctgcg 6060 
ccgacacgtg ggaccggggg aaggcgccgt ccaatggatg aatagactca ttgcctttgc 6120 
ttccagagga aatcacgtcg cccccaccca ctacgtgacg gagtcggatg cgtcgcagcg 6180 
tgtgacccaa ctacttggct cccttaccat aaccagcctg ctcagaagac tccacaactg 6240 
gattactgag gactgcccca tcccatgcgg cggctcgtgg ctccgcgatg tgtgggactg 6300 
ggtttgcacc atcctaacag actttaaaaa ttggctgacc tccaaattat tcccaaagat 6360 
gcccggcctc ccctttgtct cctgtcaaaa ggggtacaag ggcgtgtggg ccggcactgg 6420 
catcatgacc acacggtgtc cttgcggcgc caatatctct ggcaatgtcc gcttgggctc 6480 
catgagaatc acggggccta agacctgcat gaatatctgg caggggacct ttcctatcaa 6540 
ttgttacacg gagggccagt gcgtgccgaa acccgcgcca aactttaagg tcgccatctg 6600 
gagggtggcg gcctcagagt acgcggaggt gacgcagcac gggtcatacc actacataac 6660 
aggactcacc actgataact tgaaagtccc ctgccaacta ccctctcccg agttcttttc 6720 
ctgggtggac ggagtgcaga tccataggtt tgcccccaca ccgaagccgt ttttccggga 6780 
tgaggtctcg ttctgcgttg ggcttaattc atttgtcgtc gggtcccagc ttccttgcga 6840 
ccctgaaccc gacacagacg tattgatgtc catgctaaca gatccatctc atatcacggc 6900 
ggagactgca gcgcggcgtt tagcgcgggg gtcaccccca tccgaggcaa gctcctcggc 6960 
gagccagcta tcggcaccat cgctgcgagc cacctgcacc acccacggca aagcctatga 7020 
tgtggacatg gtggatgcta acctgttcat ggggggcgat gtgactcgga tagagtctgg 7080 
gtccaaagtg gtcgttctgg actctctcga cccaatggtc gaagaaagga gcgaccttga 7140 
gccttcgata ccatcagaat acatgctccc caagaagagg ttcccaccag ctttaccggc 7200 
ctgggcacgg cctgattaca acccaccgct tgtggaatcg tggaaaaggc cagattacca 7260 
accggccact gttgcgggct gtgctctccc tcctcctagg aaaaccccga cgcctccccc 7320 
aaggaggcgc cggacagtgg gcctaagtga ggactccata ggagatgccc ttcaacagct 7380 
ggccattaag tcctttggcc agcccccccc aagcggcgat tcaggccttt ccacgggggc 7440 
gggcgctgcc gattccggca gtcagacgcc tcctgatgag ttggcccttt cggagacagg 7500 
ttccatctct tccatgcccc ccctcgaggg ggagcttgga gatccagacc tggagcctga 7560 
gcaggtagag ccccaacccc ccccccaggg gggggtggca gctcccggct cggactcggg 7620 
gtcctggtct acttgctccg aggaggacga ctccgtcgtg tgctgctcca tgtcatactc 7680 
ctggaccggg gctctaataa ctccttgtag tcccgaagag gagaagttac cgattaaccc 7740 
cttgagcaac tccctgttgc gatatcacaa caaggtgtac tgtaccacaa caaagagcgc 7800 
ctcactaagg gctaaaaagg taacttttga taggatgcaa gtgctcgact cctactacga 7860 
ctcagtctta aaggacatta agctagcggc ctccaaggtc accgcaaggc tcctcaccat 7920 
ggaggaggct tgccagttaa ccccacccca ttctgcaaga tctaaatatg ggtttggggc 7980 
taaggaggtc cgcagcttgt ccgggagggc cgttaaccac atcaagtccg tgtggaagga 8040 
cctcctggag gactcagaaa caccaattcc cacaaccatt atggccaaaa atgaggtgtt 8100 
ctgcgtggac cccaccaagg ggggcaagaa agcagctcgc cttatcgttt accctgacct 8160 
cggcgtcagg gtctgcgaga agatggccct ttatgacatt acacaaaaac ttcctcaggc 8220 
ggtgatgggg gcttcttatg gattccagta ttcccccgct cagcgggtag agtttctctt 8280 
gaaagcatgg gcggaaaaga aggaccctat gggtttttcg tatgataccc gatgctttga 8340 
ctcaaccgtc actgagagag acatcaggac tgaggagtcc atatatcggg cctgctcctt 8400 
gcccgaggag gcccacactg ccatacactc gctaactgag agactttacg tgggagggcc 8460 
tatgttcaac agcaagggcc aaacctgcgg gtacaggcgt tgccgcgcca gcggggtgct 8520 
caccactagc atggggaaca ccatcacatg ctacgtgaaa gccttagcgg cttgtaaagc 8580 
tgcagggata atcgcgccca caatgctggt atgcggcgat gacttggttg tcatctcaga 8640 
aagccagggg accgaggagg acgagcggaa cctgagagcc ttcacggagg ctatgaccag 8700 
gtattctgcc cctcctggtg acccccccag accggagtat gatctggagc tgataacatc 8760 
ttgctcctca aatgtgtctg tggcgctggg cccacaaggc cgccgcagat actacctgac 8820 
cagagaccct accactccaa tcgcccgggc tgcctgggaa acagttagac actcccctgt 8880 
caattcatgg ctgggaaaca tcatccagta cgccccgacc atatgggctc gcatggtcct 8940 
gatgacacac ttcttctcca ttctcatggc tcaagacacg ctggaccaga acctcaactt 9000 
tgagatgtac ggagcggtgt actccgtgag tcccttggac ctcccagcta taattgaaag 9060 
gttacatggg cttgacgctt tttctctgca cacatacact ccccacgaac tgacacgggt 9120 
ggcttcagcc ctcagaaaac ttggggcgcc acccctcaga gcgtggaaga gccgggcacg 9180 
tgcagtcagg gcgtccctca tctcccgtgg ggggagagcg gccgtttgcg gtcgatatct 9240 
cttcaattgg gcggtgaaga ccaagctcaa actcactcca ttgccggaag cgcgcctcct 9300 
ggatttatcc agctggttca ccgtcggcgc cggcgggggc gacatttatc acagcgtgtc 9360 
gcgtgcccga ccccgcttat tgctctttgg cctactccta ctttttgtag gggtaggcct 9420 
tttcctactc cccgctcggt agagcggcac acattagcta cactccatag ctaactgtcc 9480 
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cttttttttt tttttttttt tttttttttt tttttttttt tttttttttt tttttttttt 9540 
tttttttttt tttttttttt tttttctttt tttctctttt ccttctttct taccttattt 9600 
tactttcttt cctggtggct ccatcttagc cctagtcacg gctagctgtg aaaggtccgt 9660 
gagccgcatg actgcagaga gtgccgtaac tggtctctct gcagatcatg t 9711 

<210> 5 
<211> 3033 
<212> PRT 

<213> Hepatitis C virus 
<400> 5 

Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 

1 5 10 15 

Arg Arg Pro Gin Asp val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 

20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly val Arg Ala 

35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 

50 55 60 

lie Pro Lys Asp Arg Arg Ser Thr Gly Lys Ser Trp Gly Lys Pro Gly 
65 70 75 80 

Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp 

85 * 90 95 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Asn Asp Pro 

100 105 110 

Arg His Arg Ser Arg Asn val Gly Lys Val lie Asp Thr Leu Thr Cys 

115 " 120 125 

Gly Phe Ala Asp Leu Met Gly Tyr lie Pro val val Gly Ala Pro Leu 

130 135 140 

Gly Gly val Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 150 155 160 

Gly Val Asn Phe Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser lie 

165 170 175 

Phe Leu Leu Ala Leu Leu Ser Cys lie Thr Thr Pro val Ser Ala Ala 

180 185 190 

Glu val Lys Asn lie Ser Thr Gly Tyr Met val Thr Asn Asp Cys Thr 

195 200 205 

Asn Asp Ser lie Thr Trp Gin Leu Gin Ala Ala val Leu His val Pro 

210 215 220 

Gly Cys val Pro Cys Glu Lys Val Gly Asn Ala Ser Gin Cys Trp lie 
225 230 235 240 

Pro val Ser Pro Asn val Ala val Gin Arg Pro Gly Ala Leu Thr Gin 

245 250 ~ 255 

Gly Leu Arg Thr His lie Asp Met val val Met Ser Ala Thr Leu Cys 

260 * 265 270 

Ser Ala Leu Tyr Val Gly Asp Leu Cys Gly Gly val Met Leu Ala Ala 

275 280 285 

Gin Met Phe lie val Ser Pro Gin His His Trp Phe Val Gin Asp Cys 

290 295 300 

Asn Cys Ser lie Tyr Pro Gly Thr lie Thr Gly His Arg Met Ala Trp 
305 310 315 ~ 320 

Asp Met Met Met Asn Trp Ser Pro Thr Ala Thr Met lie Leu Ala Tyr 

325 330 335 

Ala Met Arg val Pro Glu Val lie lie Asp lie lie Ser Gly Ala His 

340 345 350 

Trp Gly val Met Phe Gly Leu Ala Tyr Phe Ser Met Gin Gly Ala Trp 

355 360 365 

Ala Lys val val Val lie Leu Leu Leu Ala Ala Gly val Asp Ala Arg 

370 375 380 

Thr His Thr Val Gly Gly Ser Ala Ala Gin Thr Thr Gly Arg Leu Thr 
385 390 395 400 

Ser Leu Phe Asp Met Gly Pro Arg Gin Lys lie Gin Leu Val Asn Thr 
405 "* 410 415 
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Asn Gly Ser Trp His lie Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser 

420 425 430 

Leu His Thr Gly Phe lie Ala Ser Leu Phe Tyr Thr His Ser Phe Asn 

435 440 445 

Ser Ser Gly Cys Pro Glu Arg Met Ser Ala Cys Arg Ser lie Glu Ala 

450 455 460 

Phe Arg Val Gly Trp Gly Ala Leu Gin Tyr Glu Asp Asn val Thr Asn 
465 470 475 480 

Pro Glu Asp Met Arg Pro Tyr Cys Trp His Tyr Pro Pro Arg Gin Cys 

485 490 495 

Gly val val Ser Ala Lys Thr val Cys Gly Pro val Tyr Cys Phe Thr 

500 505 510 

Pro Ser Pro val val val Gly Thr Thr Asp Arg Leu Gly Ala Pro Thr 

515 520 525 

Tyr Thr Trp Gly Glu Asn Glu Thr Asp Val Phe Leu Leu Asn Ser Thr 

530 535 540 

Arg Pro Pro Leu Gly Ser Trp Phe Gly Cys Thr Trp Met Asn Ser Ser 
545 550 555 560 

Gly Tyr Thr Lys Thr Cys Gly Ala Pro Pro Cys Arg Thr Arg Ala Asp 

565 570 575 

Phe Asn Ala Ser Thr Asp Leu Leu Cys Pro Thr Asp Cys Phe Arg Lys 

580 585 590 

His Pro Asp Thr Thr Tyr Leu Lys Cys Gly Ser Gly Pro Trp Leu Thr 

595 600 605 

Pro Arg Cys Leu lie Asp Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys 

610 615 620 

Thr Val Asn Tyr Thr lie Phe Lys lie Arg Met Tyr val Gly Gly val 
625 630 635 640 

Glu His Arg Leu Thr Ala Ala Cys Asn Phe Thr Arg Gly Asp Arg Cys 

645 650 655 

Asn Leu Glu Asp Arg Asp Arg Ser Gin Leu Ser Pro Leu Leu His Ser 

660 665 670 

Thr Thr Glu Trp Ala lie Leu Pro Cys Ser Tyr Ser Asp Leu Pro Ala 

675 680 685 

Leu Ser Thr Gly Leu Leu His Leu His Gin Asn lie val Asp Val Gin 

690 695 700 

Phe Met Tyr Gly Leu Ser Pro Ala Leu Thr Lys Tyr lie val Arg Trp 
705 710 715 720 

Glu Trp val lie Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg val Cys 

725 730 735 

Ala Cys Leu Trp Met Leu lie Leu Leu Gly Gin Ala Glu Ala Ala Leu 

740 745 750 

Glu Lys Leu val lie Leu His Ala Ala Ser Ala Ala Ser Cys Asn Gly 

755 760 765 

Phe Leu Tyr Phe Val lie Phe Phe Val Ala Ala Trp Tyr lie Lys Gly 

770 775 780 

Arg val val Pro Leu Ala Thr Tyr Ser Leu Thr Gly Leu Trp Ser Phe 
785 790 795 800 

Ser Leu Leu Leu Leu Ala Leu Pro Gin Gin Ala Tyr Ala Tyr Asp Ala 

805 810 815 

Ser val His Gly Gin lie Gly Ala Ala Leu Leu val Met lie Thr Leu 

820 825 830 

Phe Thr Leu Thr Pro Gly Tyr Lys Thr Leu Leu Ser Arg Phe Leu Trp 

835 840 845 

Trp Leu Cys Tyr Leu Leu Thr Leu Gly Glu Ala Met val Gin Glu Trp 

850 * 855 860 

Ala Pro Pro Met Gin val Arg Gly Gly Arg Asp Gly lie lie Trp Ala 
865 870 * " 875 880 

val Ala lie Phe Tyr Pro Gly val val Phe Asp lie Thr Lys Trp Leu 

885 890 895 

Leu Ala val Leu Gly Pro Ala Tyr Leu Leu Lys Gly Ala Leu Thr Arg 

900 905 910 

val Pro Tyr Phe val Arg Ala His Ala Leu Leu Arg Met Cys Thr Met 
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915 920 925 

Ala Arg His Leu Ala Gly Gly Arg Tyr val Gin Met Ala Leu Leu Ala 

930 935 940 

Leu Gly Arg Trp Thr Gly Thr Tyr lie Tyr Asp His Leu Thr Pro Met 
945 " 950 955 960 

Ser Asp Trp Ala Ala Ser Gly Leu Arg Asp Leu Ala val Ala val Glu 

965 970 975 

Pro lie lie Phe Ser Pro Met Glu Lys Lys val lie val Trp Gly Ala 

980 985 990 

Glu Thr Ala Ala Cys Gly Asp lie Leu His Gly Leu Pro val Ser Ala 

995 1000 1005 

Arg Leu Gly Arg Glu val Leu Leu Gly Pro Ala Asp Gly Tyr Thr Ser 

1010 1015 1020 

Lys Gly Trp Ser Leu Leu Ala Pro lie Thr Ala Tyr Ala Gin Gin Thr 
1025 1030 1035 1040 

Arg Gly Leu Leu Gly Thr lie val Val Ser Met Thr Gly Arg Asp Lys 

1045 1050 1055 

Thr Glu Gin Ala Gly Glu lie Gin Val Leu Ser Thr val Thr Gin Ser 

1060 1065 1070 

Phe Leu Gly Thr Ser lie Ser Gly Val Leu Trp Thr val Tyr His Gly 

1075 1080 1085 

Ala Gly Asn Lys Thr Leu Ala Gly Ser Arg Gly Pro val Thr Gin Met 

1090 1095 ~ 1100 

Tyr Ser Ser Ala Glu Gly Asp Leu val Gly Trp Pro Ser Pro Pro Gly 
1105 1110 1115 1120 

Thr Lys Ser Leu Glu Pro Cys Thr Cys Gly Ala Val Asp Leu Tyr Leu 

1125 1130 1135 

val Thr Arg Asn Ala Asp Val lie Pro Ala Arg Arg Arg Gly Asp Lys 

1140 1145 " ~ 1150 

Arg Gly Ala Leu Leu Ser Pro Arg Pro Leu Ser Thr Leu Lys Gly Ser 

1155 1160 1165 

Ser Gly Gly Pro val Leu Cys Pro Arg Gly His Ala Val Gly val Phe 

1170 1175 " 1180 

Arg Ala Ala val Cys Ser Arg Gly val Ala Lys Ser lie Asp Phe lie 
1185 1190 1195 1200 

Pro val Glu Thr Leu Asp lie val Thr Arg Ser Pro Thr Phe Ser Asp 

1205 1210 1215 

Asn Ser Thr Pro Pro Ala val Pro Gin Thr Tyr Gin val Gly Tyr Leu 

1220 1225 1230 

His Ala Pro Thr Gly Ser Gly Lys Ser Thr Lys val Pro val Ala Tyr 

1235 1240 1245 

Ala Ala Gin Gly Tyr Lys Val Leu val Leu Asn Pro Ser val Ala Ala 

1250 ' 1255 1260 

Thr Leu Gly Phe Gly Ala Tyr Leu Ser Lys Ala His Gly lie Asn Pro 
1265 1270 1275 1280 

Asn lie Arg Thr Gly val Arg Thr Val Thr Thr Gly Ala Pro lie Thr 

1285 1290 1295 

Tyr Ser Thr Tyr Gly Lys Phe Leu Ala Asp Gly Gly Cys Ala Gly Gly 

1300 1305 1310 

Ala Tyr Asp lie lie lie Cys Asp Glu Cys His Ala val Asp Ser Thr 

1315 1320 " 1325 

Thr lie Leu Gly lie Gly Thr Val Leu Asp Gin Ala Glu Thr Ala Gly 

1330 ' 1335 ' 1340 

val Arg Leu Thr val Leu Ala Thr Ala Thr Pro Pro Gly Ser val Thr 
1345 1350 1355 1360 

Thr Pro His Pro Asn lie Glu Glu val Ala Leu Gly Gin Glu Gly Glu 

1365 1370 1375 

lie Pro Phe Tyr Gly Arg Ala lie Pro Leu Ser Tyr lie Lys Gly Gly 

1380 1385 1390 

Arg His Leu lie Phe Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala 

1395 1400 1405 

Ala Ala Leu Arg Gly Met Gly Leu Asn Ser val Ala Tyr Tyr Arg Gly 
1410 1415 1420 
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Leu Asp val Ser val lie Pro Thr Gin Gly Asp val val val val Ala 
1425 1430 1435 1440 

Thr Asp Ala Leu Met Thr Gly Tyr Thr Gly Asp Phe Asp Ser val lie 

1445 1450 1455 

Asp Cys Asn val Ala val Thr Gin Val val Asp Phe Ser Leu Asp Pro 

1460 1465 1470 

Thr Phe Thr lie Thr Thr Gin lie Val Pro Gin Asp Ala val Ser Arg 

1475 1480 1485 

Ser Gin Arg Arg Gly Arg Thr Gly Arg Gly Arg Leu Gly lie Tyr Arg 

1490 1495 1500 

Tyr val Ser Thr Gly Glu Arg Ala Ser Gly Met Phe Asp Ser Val val 
1505 1510 1515 1520 

Leu Cys Glu Cys Tyr Asp Ala Gly Ala Ala Trp Tyr Glu Leu Thr Pro 

1525 1530 1535 

Ser Glu Thr Thr Val Arg Leu Arg Ala Tyr Phe Asn Thr Pro Gly Leu 

1540 ~ 1545 1550 

Pro val Cys Gin Asp His Leu Glu Phe Trp Glu Ala Val Phe Thr Gly 

1555 1560 1565 

Leu Thr His lie Asp Ala His Phe Leu Ser Gin Thr Lys Gin Ser Gly 

1570 1575 1580 

Glu Asn Phe Ala Tyr Leu Thr Ala Tyr Gin Ala Thr Val Cys Ala Arg 
1585 1590 1595 1600 

Ala Lys Ala Pro Pro Pro Ser Trp Asp val Met Trp Lys Cys Leu Thr 

1605 1610 1615 

Arg Leu Lys Pro Thr Leu val Gly Pro Thr Pro Leu Leu Tyr Arg Leu 

1620 1625 1630 

Gly Ser Val Thr Asn Glu val Thr Leu Thr His Pro val Thr Lys Tyr 

1635 1640 1645 

lie Ala Thr Cys Met Gin Ala Asp Leu Glu Val Met Thr Ser Thr Trp 

1650 1655 1660 

val Leu Ala Gly Gly val Leu Ala Ala val Ala Ala Tyr Cys Leu Ala 
1665 1670 1675 1680 

Thr Gly Cys val Cys lie lie Gly Arg Leu His lie Asn Gin Arg Ala 

1685 1690 1695 

val val Ala Pro Asp Lys Glu val Leu Tyr Glu Ala Phe Asp Glu Met 

1700 1705 1710 

Glu Glu Cys Ala Ser Arg Ala Ala Leu lie Glu Glu Gly Gin Arg lie 

1715 ^ 1720 1725 

Ala Glu Met Leu Lys Ser Lys lie Gin Gly Leu Leu Gin Gin Ala Ser 

1730 1735 1740 

Lys Gin Ala Gin Asp lie Gin Pro Thr val Gin Ala Ser Trp Pro Lys 
1745 1750 1755 1760 

val Glu Gin Phe Trp Ala Lys His Met Trp Asn Phe lie Ser Gly lie 

1765 1770 1775 

Gin Tyr Leu Ala Gly Leu Ser Thr Leu Pro Gly Asn Pro Ala Val Ala 

1780 1785 1790 

Ser Met Met Ala Phe Ser Ala Ala Leu Thr Ser Pro Leu Ser Thr Ser 

1795 1800 1805 

Thr Thr lie Leu Leu Asn lie Leu Gly Gly Trp Leu Ala Ser Gin lie 

1810 1815 1820 

Ala Pro Pro Ala Gly Ala Thr Gly Phe val val Ser Gly Leu Val Gly 
1825 1830 1835 1840 

Ala Ala Val Gly Ser lie Gly Leu Gly Lys Val Leu Val Asp lie Leu 

1845 1850 1855 

Ala Gly Tyr Gly Ala Gly lie Ser Gly Ala Leu val Ala Phe Lys lie 

1860 1865 1870 

Met Ser Gly Glu Lys Pro Ser Met Glu Asp val Val Asn Leu Leu Pro 

1875 1880 1885 

Gly lie Leu Ser Pro Gly Ala Leu val val Gly val lie Cys Ala Ala 

1890 1895 1900 

He Leu Arg Arg His val Gly Pro Gly Glu Gly Ala Val Gin Trp Met 
1905 ~ 1910 1915 1920 

Asn Arg Leu lie Ala Phe Ala Ser Arg Gly Asn His Val Ala Pro Thr 
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1925 1930 1935 

His Tyr val Thr Glu Ser Asp Ala Ser Gin Arg val Thr Gin Leu Leu 

1940 1945 " 1950 

Gly Ser Leu Thr lie Thr Ser Leu Leu Arg Arg Leu His Asn Trp lie 

1955 1960 * 1965 

Thr Glu Asp Cys Pro lie Pro Cys Gly Gly Ser Trp Leu Arg Asp Val 

1970 1975 1980 

Trp Asp Trp Val Cys Thr lie Leu Thr Asp Phe Lys Asn Trp Leu Thr 
1985 1990 1995 2000 

Ser Lys Leu Phe Pro Lys Met Pro Gly Leu Pro Phe Val Ser Cys Gin 

2005 2010 2015 

Lys Gly Tyr Lys Gly val Trp Ala Gly Thr Gly lie Met Thr Thr Arg 

2020 2025 2030 

Cys Pro Cys Gly Ala Asn lie Ser Gly Asn val Arg Leu Gly Ser Met 

2035 2040 2045 

Arg lie Thr Gly Pro Lys Thr Cys Met Asn lie Trp Gin Gly Thr Phe 

2050 ' 2055 2060 

Pro lie Asn Cys Tyr Thr Glu Gly Gin Cys val Pro Lys Pro Ala Pro 
2065 2070 2075 2080 

Asn Phe Lys val Ala lie Trp Arg val Ala Ala Ser Glu Tyr Ala Glu 

2085 2090 2095 

Val Thr Gin His Gly Ser Tyr His Tyr lie Thr Gly Leu Thr Thr Asp 

2100 2105 2110 

Asn Leu Lys Val Pro Cys Gin Leu Pro Ser Pro Glu Phe Phe Ser Trp 

2115 2120 2125 

val Asp Gly val Gin lie His Arg Phe Ala Pro Thr Pro Lys Pro Phe 

2130 2135 2140 

Phe Arg Asp Glu val Ser Phe Cys Val Gly Leu Asn Ser Phe val val 
2145 2150 2155 2160 

Gly Ser Gin Leu Pro Cys Asp Pro Glu Pro Asp Thr Asp Val Leu Met 

2165 2170 2175 

Ser Met Leu Thr Asp Pro Ser His lie Thr Ala Glu Thr Ala Ala Arg 

2180 2185 2190 

Arg Leu Ala Arg Gly Ser Pro Pro Ser Glu Ala Ser Ser Ser Ala Ser 

2195 2200 2205 

Gin Leu Ser Ala Pro Ser Leu Arg Ala Thr Cys Thr Thr His Gly Lys 

2210 2215 2220 

Ala Tyr Asp val Asp Met val Asp Ala Asn Leu Phe Met Gly Gly Asp 
2225 2230 2235 2240 

Val Thr Arg lie Glu Ser Gly Ser Lys Val val Val Leu Asp Ser Leu 

2245 2250 2255 

Asp Pro Met val Glu Glu Arg Ser Asp Leu Glu Pro Ser lie Pro Ser 

2260 2265 2270 

Glu Tyr Met Leu Pro Lys Lys Arg Phe Pro Pro Ala Leu Pro Ala Trp 

2275 2280 2285 

Ala Arg Pro Asp Tyr Asn Pro Pro Leu Val Glu Ser Trp Lys Arg Pro 

2290 2295 2300 

Asp Tyr Gin Pro Ala Thr val Ala Gly Cys Ala Leu Pro Pro Pro Arg 
2305 2310 2315 2320 

Lys Thr Pro Thr Pro Pro Pro Arg Arg Arg Arg Thr Val Gly Leu Ser 

2325 2330 2335 

Glu Asp Ser lie Gly Asp Ala Leu Gin Gin Leu Ala lie Lys Ser Phe 

2340 2345 2350 

Gly Gin Pro Pro Pro Ser Gly Asp Ser Gly Leu Ser Thr Gly Ala Gly 

2355 2360 2365 

Ala Ala Asp Ser Gly Ser Gin Thr Pro Pro Asp Glu Leu Ala Leu Ser 

2370 ' 2375 2380 

Glu Thr Gly Ser lie Ser Ser Met Pro Pro Leu Glu Gly Glu Leu Gly 
2385 2390 2395 2400 

Asp Pro Asp Leu Glu Pro Glu Gin val Glu Pro Gin Pro Pro Pro Gin 

2405 2410 2415 

Gly Gly Val Ala Ala Pro Gly Ser Asp Ser Gly Ser Trp Ser Thr Cys 
2420 2425 2430 
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Ser Glu Glu Asp Asp Ser Val val Cys Cys Ser Met Ser Tyr Ser Trp 

2435 2440 2445 

Thr Gly Ala Leu lie Thr Pro Cys Ser Pro Glu Glu Glu Lys Leu Pro 

2450 2455 2460 

lie Asn Pro Leu Ser Asn Ser Leu Leu Arg Tyr His Asn Lys val Tyr 
2465 2470 2475 2480 

Cys Thr Thr Thr Lys Ser Ala Ser Leu Arg Ala Lys Lys Val Thr Phe 

2485 2490 2495 

Asp Arg Met Gin val Leu Asp Ser Tyr Tyr Asp Ser val Leu Lys Asp 

2500 2505 2510 

lie Lys Leu Ala Ala Ser Lys val Thr Ala Arg Leu Leu Thr Met Glu 

2515 2520 2525 

Glu Ala Cys Gin Leu Thr Pro Pro His Ser Ala Arg Ser Lys Tyr Gly 

2530 2535 2540 

Phe Gly Ala Lys Glu val Arg Ser Leu Ser Gly Arg Ala Val Asn His 
2545 2550 2555 2560 

lie Lys Ser val Trp Lys Asp Leu Leu Glu Asp Ser Glu Thr Pro lie 

2565 2570 2575 

Pro Thr Thr lie Met Ala Lys Asn Glu val Phe Cys Val Asp Pro Thr 

2580 2585 2590 

Lys Gly Gly Lys Lys Ala Ala Arg Leu lie Val Tyr Pro Asp Leu Gly 

2595 2600 2605 

val Arg val Cys Glu Lys Met Ala Leu Tyr Asp lie Thr Gin Lys Leu 

2610 2615 2620 

Pro Gin Ala val Met Gly Ala Ser Tyr Gly Phe Gin Tyr Ser Pro Ala 
2625 2630 2635 2640 

Gin Arg Val Glu Phe Leu Leu Lys Ala Trp Ala Glu Lys Lys Asp Pro 

2645 2650 2655 

Met Gly Phe Ser Tyr Asp Thr Arg Cys Phe Asp Ser Thr val Thr Glu 

2660 2665 2670 

Arg Asp lie Arg Thr Glu Glu Ser lie Tyr Arg Ala Cys Ser Leu Pro 

2675 2680 * " 2685 

Glu Glu Ala His Thr Ala lie His Ser Leu Thr Glu Arg Leu Tyr val 

2690 2695 2700 

Gly Gly Pro Met Phe Asn Ser Lys Gly Gin Thr Cys Gly Tyr Arg Arg 
2705 2710 2715 2720 

Cys Arg Ala Ser Gly val Leu Thr Thr Ser Met Gly Asn Thr lie Thr 

2725 2730 2735 

Cys Tyr val Lys Ala Leu Ala Ala Cys Lys Ala Ala Gly lie lie Ala 

2740 2745 2750 

Pro Thr Met Leu val Cys Gly Asp Asp Leu val val lie Ser Glu Ser 

2755 2760 2765 

Gin Gly Thr Glu Glu Asp Glu Arg Asn Leu Arg Ala Phe Thr Glu Ala 

2770 2775 2780 

Met Thr Arg Tyr Ser Ala Pro Pro Gly Asp Pro Pro Arg Pro Glu Tyr 
2785 ~ 2790 2795 2800 

Asp Leu Glu Leu lie Thr Ser Cys Ser Ser Asn val Ser Val Ala Leu 

2805 2810 2815 

Gly Pro Gin Gly Arg Arg Arg Tyr Tyr Leu Thr Arg Asp Pro Thr Thr 

2820 2825 2830 

Pro lie Ala Arg Ala Ala Trp Glu Thr Val Arg His Ser Pro Val Asn 

2835 2840 2845 

Ser Trp Leu Gly Asn lie lie Gin Tyr Ala Pro Thr lie Trp Ala Arg 

2850 2855 2860 

Met Val Leu Met Thr His Phe Phe Ser lie Leu Met Ala Gin Asp Thr 
2865 2870 2875 2880 

Leu Asp Gin Asn Leu Asn Phe Glu Met Tyr Gly Ala val Tyr Ser val 

2885 2890 2895 

Ser Pro Leu Asp Leu Pro Ala lie lie Glu Arg Leu His Gly Leu Asp 

2900 2905 2910 

Ala Phe Ser Leu His Thr Tyr Thr Pro His Glu Leu Thr Arg val Ala 

2915 2920 2925 

Ser Ala Leu Arg Lys Leu Gly Ala Pro Pro Leu Arg Ala Trp Lys Ser 
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2930 2935 2940 

Arg Ala Arg Ala val Arg Ala Ser Leu He Ser Arg Gly Gly Arg Ala 
2945 2950 2955 2960 

Ala val Cys Gly Arg Tyr Leu Phe Asn Trp Ala val Lys Thr Lys Leu 

2965 , 2970 2975 

Lys Leu Thr Pro Leu Pro Glu Ala Arg Leu Leu Asp Leu Ser Ser Trp 

2980 2985 2990 

Phe Thr Val Gly Ala Gly Gly Gly Asp lie Tyr His Ser val, Ser Arg 

2995 3000 3005 

Ala Arg Pro Arg Leu Leu Leu Phe Gly Leu Leu Leu Leu Phe val Gly 

3010 3015 3020 

val Gly Leu Phe Leu Leu Pro Ala Arg 
3025 3030 



<210> 6 
<211> 9599 
<212> DNA 

<213> Hepatitis C virus strain H77 



<400> 6 

gccagccccc 

tcttcacgca 

cccccctccc 

gacgaccggg 

gcaagactgc 

gtgcttgcga 

ctcaaagaaa 

gcggtcagat 

gcgcgacgag 

aggcacgtcg 

gcaatgaggg 

ggggccccac 

cgtgcggctt 

ctgccagggc 

ggaaccttcc 

tgcccgcttc 

gccctaactc 

tcccttgcgt 

ccaccaggga 

ggagcgccac 

ttggtcaact 

ctatctatcc 

cccctacggc 

tgatcgctgg 

actgggcgaa 

tcaccggggg 

ccaagcagaa 

tgaattgcaa 

tcaactcttc 

agggctgggg 

ggcactaccc 

attgcttcac 

acagctgggg 

gcaattggtt 

ccccttgtgt 

gcaaacatcc 

gcatggtcga 

tcaaagtcag 

cgcggggcga 

tgtccaccac 

ccggcctcat 

caagcatcgc 



tgatgggggc 
gaaagcgtct 
gggagagcca 
tcctttcttg 
tagccgagta 
gtgccccggg 
aaccaaacgt 
cgttggtgga 
gaagacttcc 
gcccgagggc 
ttgcgggtgg 
agacccccgg 
cgccgacctc 
cctggcgcat 
tggttgctct 
agcctaccaa 
gagtattgtg 
tcgcgagggt 
cggcaaactc 
cctctgctcg 
gtttaccttc 
cggccatata 
agcgttggtg 
tgctcactgg 
ggtcctggta 
aaatgccggc 
catccaactg 
tgaaagcctt 
aggctgtcct 
tcctatcagt 
tccaagacct 
tcccagcccc 
tgcaaatgat 
cggttgtacc 
catcggaggg 
ggaagccaca 
ctacccgtat 
gatgtacgtg 
acgctgtgat 
acagtggcag 
ccacctccac 
gtcctgggcc 



gacactccac 
agccatggcg 
tagtggtctg 
gataaacccg 
gtgttgggtc 
aggtctcgta 
aacaccaacc 
gtttacttgt 
gagcggtcgc 
aggacctggg 
gcgggatggc 
cgtaggtcgc 
atggggtaca 
ggcgtccggg 
ttctctatct 
gtgcgcaatt 
tacgaggcgg 
aacgcctcga 
cccacaacgc 
gccctctacg 
tctcccaggc 
acgggtcatc 
gtagctcagc 
ggagtcctgg 
gtgctgctgc 
cgcaccacgg 
atcaacacca 
aacaccggct 
gagaggttgg 
tatgccaacg 
tgtggcattg 
gtggtggtgg 
acggatgtct 
tggatgaact 
gtgggcaaca 
tactctcggt 
aggctttggc 
ggaggggtcg 
ctggaagaca 
gtccttccgt 
cagaacattg 
attaagtggg 



catgaatcac 
ttagtatgag 
cggaaccggt 
ctcaatgcct 
gcgaaaggcc 
gaccgtgcac 
gtcgcccaca 
tgccgcgcag 
aacctcgagg 
ctcagcccgg 
tcctgtctcc 
gcaatttggg 
taccgctcgt 
ttctggaaga 
tccttctggc 
cctcggggct 
ccgatgccat 
ggtgttgggt 
agcttcgacg 
tgggggacct 
gccactggac 
gcatggcatg 
tgctccggat 
cgggcatagc 
tatttgccgg 
ctgggcttgt 
acggcagttg 
ggttagcagg 
ccagctgccg 
gaagcggcct 
tgcccgcaaa 
gaacgaccga 
tcgtccttaa 
caactggatt 
acaccttgct 
gcggctccgg 
actatccttg 
agcacaggct 
gggacaggtc 
gttctttcac 
tggacgtgca 
agtacgtcgt 
Page 21 



tcccctgtga 
tgtcgtgcag 
gagtacaccg 
ggagatttgg 
ttgtggtact 
catgagcacg 
ggacgtcaag 
gggccctaga 
tagacgtcag 
gtacccttgg 
ccgtggctct 
taaggtcatc 
cggcgcccct 
cggcgtgaac 
cctgctctct 
ttaccatgtc 
cctgcacact 
ggcggtgacc 
tcatatcgat 
gtgcgggtct 
gacgcaagac 
ggatatgatg 
cccacaagcc 
gtatttctcc 
cgtcgacgcg 
tggtctcctt 
gcacatcaat 
gctcttctat 
acgccttacc 
cgacgaacgc 
gagcgtgtgt 
caggtcgggc 
caacaccagg 
caccaaagtg 
ctgccccact 
tccctggatt 
taccatcaat 
ggaagcggcc 
cgagctcagc 
gaccctgcca 
gtacttgtac 
tctcctgttc 



ggaactactg 
cctccaggac 
gaattgccag 
gcgtgccccc 
gcctgatagg 
aatcctaaac 
ttcccgggtg 
ttgggtgtgc 
cctatcccca 
cccctctatg 
cggcctagct 
gataccctta 
cttggaggcg 
tatgcaacag 
tgcctgactg 
accaatgatt 
ccggggtgtg 
cccacggtgg 
ctgcttgtcg 
gtctttcttg 
tgcaattgtt 
atgaactggt 
atcatggaca 
atggtgggga 
gaaacccacg 
acaccaggcg 
agcacggcct 
caacacaaat 
gattttgccc 
ccctactgct 
ggcccggtat 
gcgcctacct 
ccaccgctgg 
tgcggagcgc 
gattgcttcc 
acacccaggt 
tacaccatat 
tgcaactgga 
ccgttgctgc 
gccttgtcca 
ggggtagggt 
cttctgcttg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 



cagacgcgcg 
ctttggagaa 
ccttcctcgt 
tctacgccct 
catacgcact 
tggcgctgac 
agtattttct 

ggggggggcg 

acatcaccaa 
ttaaagtccc 
agatagccgg 
cctatgtgta 
tggccgtggc 
gggcagatac 
gccaggagat 
cgcccatcac 
tgactggccg 
aaaccttcct 
cgaggaccat 
accttgtggg 
cctcggacct 
atagcagggg 
gtccgctgtt 
gtggagtggc 
ccccggtgtt 
acctgcatgc 
agggctacaa 
acatgtccaa 
ctggcagccc 
gaggtgctta 
tgggcatcgg 
ccactgctac 
tgtccaccac 
ggggaagaca 
tggtcgcatt 
cgaccagcgg 
acttcgactc 
accctacctt 
gccggggcag 
gcccctccgg 
ggtatgagct 
ggcttcccgt 
atatagatgc 
tagcgtacca 
tgtggaagtg 
gactgggcgc 
catgcatgtc 
tggctgctct 
tcttgtccgg 
agatggaaga 
agttcaagca 
cccctgctgt 
atttcatcag 
ttgcttcatt 
tcctcttcaa 
ctgcctttgt 
tcctcgtgga 
agatcatgag 
tctcgcctgg 
gcccgggcga 
accatgtttc 
tactcagcag 
agtgtaccac 



cgtctgctcc 
cctcgtaata 
gttcttctgc 
ctacgggatg 
ggacacggag 
tctgtcgcca 
gaccagagta 
cgatgccgtc 
actactcctg 
ctacttcgtg 
aggtcattac 
taaccatctc 
tgtggaacca 
cgccgcgtgc 
actgcttggg 
ggcgtacgcc 
ggacaaaaac 
ggcaacgtgc 
cgcatcaccc 
ctggcccgct 
ttacctggtc 
tagcctgctt 
gtgccccgcg 
taaagcggtg 
cacggacaac 
tcccaccggc 
ggtgttggtg 
ggcccatggg 
catcacgtac 
tgacataata 
cactgtcctt 
ccctccgggc 
cggagagatc 
tctcatcttc 
gggcatcaat 
cgatgttgtc 
tgtgatagac 
taccattgag 
gactggcagg 
catgttcgac 
cacgcccgcc 
gtgccaggac 
ccacttttta 
agccaccgtg 
tttgatccgc 
tgttcagaat 
ggccgacctg 
ggccgcgtat 
gaagccggca 
gtgctctcag 
gaaggccctc 
ccagaccaac 
tgggatacaa 
gatggctttt 
catattgggg 

gggtgctggc 

cattcttgca 
cggtgaggtc 
agcccttgta 
gggggcagtg 
ccccacgcac 
cctcactgta 
tccatgctcc 
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tgcttgtgga tgatgttact catatcccaa gcggaggcgg 2580 

ctcaatgcag catccctggc cgggacgcac ggtcttgtgt 2640 

tttgcgtggt atctgaaggg taggtgggtg cccggagcgg 2700 

tggcctctcc tcctgctcct gctggcgttg cctcagcggg 2760 

gtggccgcgt cgtgtggcgg cgttgttctt gtcgggttaa 2820 

tattacaagc gctatatcag ctggtgcatg tggtggcttc 2880 

gaagcgcaac tgcacgtgtg ggttcccccc ctcaacgtcc 2940 

atcttactca tgtgtgtagt acacccgacc ctggtatttg 3000 

gccatcttcg gacccctttg gattcttcaa gccagtttgc 3060 

cgcgttcaag gccttctccg gatctgcgcg ctagcgcgga 3120 

gtgcaaatgg ccatcatcaa gttaggggcg cttactggca 3180 

acccctcttc gagactgggc gcacaacggc ctgcgagatc 3240 

gtcgtcttct cccgaatgga gaccaagctc atcacgtggg 3300 

ggtgacatca tcaacggctt gcccgtctct gcccgtaggg 3360 

ccagccgacg gaatggtctc caaggggtgg aggttgctgg 3420 

cagcagacga gaggcctcct agggtgtata atcaccagcc 3480 

caagtggagg gtgaggtcca gatcgtgtca actgctaccc 3540 

atcaatgggg tatgctggac tgtctaccac ggggccggaa 3600 

aagggtcctg tcatccagat gtataccaat gtggaccaag 3660 

cctcaaggtt cccgctcatt gacaccctgt acctgcggct 3720 

acgaggcacg ccgatgtcat tcccgtgcgc cggcgaggtg 3780 

tcgccccggc ccatttccta cttgaaaggc tcctcggggg 3840 

ggacacgccg tgggcctatt cagggccgcg gtgtgcaccc 3900 

gactttatcc ctgtggagaa cctagggaca accatgagat 3960 

tcctctccac cagcagtgcc ccagagcttc caggtggccc 4020 

agcggtaaga gcaccaaggt cccggctgcg tacgcagccc 4080 

ctcaacccct ctgttgctgc aacgctgggc tttggtgctt 4140 

gttgatccta atatcaggac cggggtgaga acaattacca 4200 

tccacctacg gcaagttcct tgccgacggc gggtgctcag 4260 

atttgtgacg agtgccactc cacggatgcc acatccatct 4320 

gaccaagcag agactgcggg ggcgagactg gttgtgctcg 4380 

tccgtcactg tgtcccatcc taacatcgag gaggttgctc 4440 

cccttttacg gcaaggctat ccccctcgag gtgatcaagg 4500 

tgccactcaa agaagaagtg cgacgagctc gccgcgaagc 4560 

gccgtggcct actaccgcgg tcttgacgtg tctgtcatcc 4620 

gtcgtgtcga ccgatgctct catgactggc tttaccggcg 4680 

tgcaacacgt gtgtcactca gacagtcgat ttcagccttg 4740 

acaaccacgc tcccccagga tgctgtctcc aggactcaac 4800 

gggaagccag gcatctatag atttgtggca ccgggggagc 4860 

tcgtccgtcc tctgtgagtg ctatgacgcg ggctgtgctt 4920 

gagactacag ttaggctacg agcgtacatg aacaccccgg 4980 

catcttgaat tttgggaggg cgtctttacg ggcctcactc 5040 

tcccagacaa agcagagtgg ggagaacttt ccttacctgg 5100 

tgcgctaggg ctcaagcccc tcccccatcg tgggaccaga 5160 

cttaaaccca ccctccatgg gccaacaccc ctgctataca 5220 

gaagtcaccc tgacgcaccc aatcaccaaa tacatcatga 5280 

gaggtcgtca cgagcacctg ggtgctcgtt ggcggcgtcc 5340 

tgcctgtcaa caggctgcgt ggtcatagtg ggcaggatcg 5400 

attatacctg acagggaggt tctctaccag gagttcgatg 5460 

cacttaccgt acatcgagca agggatgatg ctcgctgagc 5520 

ggcctcctgc agaccgcgtc ccgccatgca gaggttatca 5580 

tggcagaaac tcgaggtctt ttgggcgaag cacatgtgga 5640 

tacttggcgg gcctgtcaac gctgcctggt aaccccgcca 5700 

acagctgccg tcaccagccc actaaccact ggccaaaccc 5760 

gggtgggtgg ctgcccagct cgccgccccc ggtgccgcta 5820 

ctagctggcg ccgccatcgg cagcgttgga ctggggaagg 5880 

gggtatggcg cgggcgtggc gggagctctt gtagcattca 5940 

ccctccacgg aggacctggt caatctgctg cccgccatcc 6000 

gtcggtgtgg tctgcgcagc aatactgcgc cggcacgttg 6060 

caatggatga accggctaat agccttcgcc tcccggggga 6120 

tacgtgccgg agagcgatgc agccgcccgc gtcactgcca 6180 

acccagctcc tgaggcgact gcatcagtgg ataagctcgg 6240 

ggttcctggc taagggacat ctgggactgg atatgcgagg 6300 
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tgctgagcga ctttaagacc tggctgaaag ccaagctcat gccacaactg cctgggattc 6360 
cctttgtgtc ctgccagcgc gggtataggg gggtctggcg aggagacggc attatgcaca 6420 
ctcgctgcca ctgtggagct gagatcactg gacatgtcaa aaacgggacg atgaggatcg 6480 
tcggtcctag gacctgcagg aacatgtgga gtgggacgtt ccccattaac gcctacacca 6540 
cgggcccctg tactcccctt cctgcgccga actataagtt cgcgctgtgg agggtgtctg 6600 
cagaggaata cgtggagata aggcgggtgg gggacttcca ctacgtatcg ggtatgacta 6660 
ctgacaatct taaatgcccg tgccagatcc catcgcccga atttttcaca gaattggacg 6720 
gggtgcgcct acacaggttt gcgccccctt gcaagccctt gctgcgggag gaggtatcat 6780 
tcagagtagg actccacgag tacccggtgg ggtcgcaatt accttgcgag cccgaaccgg 6840 
acgtagccgt gttgacgtcc atgctcactg atccctccca tataacagca gaggcggccg 6900 
ggagaaggtt ggcgagaggg tcaccccctt ctatggccag ctcctcggct agccagctgt 6960 
ccgctccatc tctcaaggca acttgcaccg ccaaccatga ctcccctgac gccgagctca 7020 
tagaggctaa cctcctgtgg aggcaggaga tgggcggcaa catcaccagg gttgagtcag 7080 
agaacaaagt ggtgattctg gactccttcg atccgcttgt ggcagagga^ gatgagcggg 7140 
aggtctccgt acctgcagaa attctgcgga agtctcggag attcgcccgg gccctgcccg 7200 
tctgggcgcg gccggactac aaccccccgc tagtagagac gtggaaaaag cctgactacg 7260 
aaccacctgt ggtccatggc tgcccgctac cacctccacg gtcccctcct gtgcctccgc 7320 
ctcggaaaaa gcgtacggtg gtcctcaccg aatcaaccct atctactgcc ttggccgagc 7380 
ttgccaccaa aagttttggc agctcctcaa cttccggcat tacgggcgac aatacgacaa 7440 
catcctctga gcccgcccct tctggctgcc cccccgactc cgacgttgag tcctattctt 7500 
ccatgccccc cctggagggg gagcctgggg atccggatct cagcgacggg tcatggtcga 7560 
cggtcagtag tggggccgac acggaagatg tcgtgtgctg ctcaatgtct tattcctgga 7620 
caggcgcact cgtcaccccg tgcgctgcgg aagaacaaaa actgcccatc aacgcactga 7680 
gcaactcgtt gctacgccat cacaatctgg tgtattccac cacttcacgc agtgcttgcc 7740 
aaaggcagaa gaaagtcaca tttgacagac tgcaagttct ggacagccat taccaggacg 7800 
tgctcaagga ggtcaaagca gcggcgtcaa aagtgaaggc taacttgcta tccgtagagg 7860 
aagcttgcag cctgacgccc ccacattcag ccaaatccaa gtttggctat ggggcaaaag 7920 
acgtccgttg ccatgccaga aaggccgtag cccacatcaa ctccgtgtgg aaagaccttc 7980 
tggaagacag tgtaacacca atagacacta ccatcatggc caagaacgag gttttctgcg 8040 
ttcagcctga gaaggggggt cgtaagccag ctcgtctcat cgtgttcccc gacctgggcg 8100 
tgcgcgtgtg cgagaagatg gccctgtacg acgtggttag caagctcccc ctggccgtga 8160 
tgggaagctc ctacggattc caatactcac caggacagcg ggttgaattc ctcgtgcaag 8220 
cgtggaagtc caagaagacc ccgatggggt tctcgtatga tacccgctgt tttgactcca 8280 
cagtcactga gagcgacatc cgtacggagg aggcaattta ccaatgttgt gacctggacc 8340 
cccaagcccg cgtggccatc aagtccctca ctgagaggct ttatgttggg ggccctctta 8400 
ccaattcaag gggggaaaac tgcggctacc gcaggtgccg cgcgagcggc gtactgacaa 8460 
ctagctgtgg taacaccctc acttgctaca tcaaggcccg ggcagcctgt cgagccgcag 8520 
ggctccagga ctgcaccatg ctcgtgtgtg gcgacgactt agtcgttatc tgtgaaagtg 8580 
cgggggtcca ggaggacgcg gcgagcctga gagccttcac ggaggctatg accaggtact 8640 
ccgccccccc cggggacccc ccacaaccag aatacgactt ggagcttata acatcatgct 8700 
cctccaacgt gtcagtcgcc cacgacggcg ctggaaagag ggtctactac cttacccgtg 8760 
accctacaac ccccctcgcg agagccgcgt gggagacagc aagacacact ccagtcaatt 8820 
cctggctagg caacataatc atgtttgccc ccacactgtg ggcgaggatg atactgatga 8880 
cccatttctt tagcgtcctc atagccaggg atcagcttga acaggctctt aactgtgaga 8940 
tctacggagc ctgctactcc atagaaccac tggatctacc tccaatcatt caaagactcc 9000 
atggcctcag cgcattttca ctccacagtt actctccagg tgaaatcaat agggtggccg 9060 
catgcctcag aaaacttggg gtcccgccct tgcgagcttg gagacaccgg gcccggagcg 9120 
tccgcgctag gcttctgtcc agaggaggca gggctgccat atgtggcaag tacctcttca 9180 
actgggcagt aagaacaaag ctcaaactca ctccaatagc ggccgctggc cggctggact 9240 
tgtccggttg gttcacggct ggctacagcg ggggagacat ttatcacagc gtgtctcatg 9300 
cccggccccg ctggttctgg ttttgcctac tcctgctcgc tgcaggggta ggcatctacc 9360 
tcctccccaa ccgatgaagg ttggggtaaa cactccggcc tcttaagcca tttcctgttt 9420 
tttttttttt tttttttttt tttttctttt tttttttctt tcctttcctt ctttttttcc 9480 
tttctttttc ccttctttaa tggtggctcc atcttagccc tagtcacggc tagctgtgaa 9540 
aggtccgtga gccgcatgac tgcagagagt gctgatactg gcctctctgc agatcatgt 9599 

<210> 7 
<211> 3011 
<212> PRT 

<213> Hepatitis C virus strain H77 
<400> 7 

Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 
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15 10 15 

Arq Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin lie val Gly 

20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly val Arg Ala 

35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 

50 55 60 

lie Pro Lys Ala Arg Arg Pro Glu Gly Arg Thr Trp Ala Gin Pro Gly 
65 70 75 80 

Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala Gly Trp 

85 90 95 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro 

100 " 105 110 

Arg Arg Arg Ser Arg Asn Leu Gly Lys Val lie Asp Thr Leu Thr Cys 

115 120 125 

Gly Phe Ala Asp Leu Met Gly Tyr lie Pro Leu Val Gly Ala Pro Leu 

130 135 140 

Gly Gly Ala Ala Arg Ala Leu Ala His Gly val Arg val Leu Glu A$p 
145 150 155 160 

Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser lie 

165 170 175 

Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr 

180 185 190 

Gin Val Arg Asn Ser Ser Gly Leu Tyr His Val Thr Asn Asp Cys Pro 

195 200 205 

Asn Ser Ser lie Val Tyr Glu Ala Ala Asp Ala lie Leu His Thr Pro 

210 215 220 

Gly Cys val Pro Cys val Arg Glu Gly Asn Ala Ser Arg Cys Trp Val 
225 230 235 240 

Ala val Thr Pro Thr Val Ala Thr Arg Asp Gly Lys Leu Pro Thr Thr 

245 250 255 

Gin Leu Arg Arg His lie Asp Leu Leu Val Gly Ser Ala Thr Leu Cys 

260 265 270 

Ser Ala Leu Tyr val Gly Asp Leu Cys Gly Ser Val Phe Leu Val Gly 

275 280 285 

Gin Leu Phe Thr Phe Ser Pro Arg Arg His Trp Thr Thr Gin Asp Cys 

290 295 300 

Asn Cys Ser lie Tyr Pro Gly His lie Thr Gly His Arg Met Ala Trp 
305 310 315 320 

Asp Met Met Met Asn Trp Ser Pro Thr Ala Ala Leu val val Ala Gin 

325 330 335 

Leu Leu Arg lie Pro Gin Ala lie Met Asp Met lie Ala Gly Ala His 

340 345 350 

Trp Gly Val Leu Ala Gly lie Ala Tyr Phe Ser Met val Gly Asn Trp 

355 360 365 

Ala Lys val Leu val val Leu Leu Leu Phe Ala Gly val Asp Ala Glu 

370 375 380 

Thr His Val Thr Gly Gly Asn Ala Gly Arg Thr Thr Ala Gly Leu val 
385 390 395 400 

Gly Leu Leu Thr Pro Gly Ala Lys Gin Asn lie Gin Leu lie Asn Thr 

405 410 415 

Asn Gly Ser Trp His lie Asn Ser Thr Ala Leu Asn Cys Asn Glu Ser 

420 425 430 

Leu Asn Thr Gly Trp Leu Ala Gly Leu Phe Tyr Gin His Lys Phe Asn 

435 440 445 

Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys Arg Arg Leu Thr Asp 

450 455 460 

Phe Ala Gin Gly Trp Gly Pro lie Ser Tyr Ala Asn Gly Ser Gly Leu 
465 470 475 480 

Asp Glu Arg Pro Tyr Cys Trp His Tyr Pro Pro Arg Pro Cys Gly lie 

485 490 495 

val Pro Ala Lys Ser Val Cys Gly Pro val Tyr Cys Phe Thr Pro Ser 
500 505 510 
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Pro val Val val Gly Thr Thr Asp Arg Ser Gly Ala Pro Thr Tyr Ser 

515 520 525 

Trp Gly Ala Asn Asp Thr Asp val Phe val Leu Asn Asn Thr Arg Pro 

530 535 540 

pro Leu Gly Asn Trp Phe Gly Cys Thr Trp Met Asn Ser Thr Gly Phe 
545 550 555 560 

Thr Lys Val Cys Gly Ala Pro Pro Cys Val He Gly Gly Val Gly Asn 

565 570 575 

Asn Thr Leu Leu Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala 

580 585 590 

Thr Tyr Ser Arg Cys Gly Ser Gly Pro Trp lie Thr Pro Arg Cys Met 

595 600 605 

val Asp Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr lie Asn Tyr 

610 615 620 

Thr He Phe Lys val Arg Met Tyr val Gly Gly Val Glu His Arg Leu 
625 630 635 640 

Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp 

645 650 655 

Arq Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Gin Trp 

660 665 670 

Gin val Leu Pro Cys Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly 

675 680 685 

Leu lie His Leu His Gin Asn lie val Asp val Gin Tyr Leu Tyr Gly 

690 695 700 

val Gly Ser Ser lie Ala Ser Trp Ala lie Lys Trp Glu Tyr val val 
705 710 715 720 

Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg val Cys Ser Cys Leu Trp 

725 730 735 

Met Met Leu Leu lie Ser Gin Ala Glu Ala Ala Leu Glu Asn Leu val 

740 745 750 

lie Leu Asn Ala Ala Ser Leu Ala Gly Thr His Gly Leu val Ser Phe 

755 760 765 

Leu val Phe Phe Cys Phe Ala Trp Tyr Leu Lys Gly Arg Trp val Pro 

770 ' 775 780 

Gly Ala val Tyr Ala Leu Tyr Gly Met Trp Pro Leu Leu Leu Leu Leu 
785 790 795 800 

Leu Ala Leu Pro Gin Arg Ala Tyr Ala Leu Asp Thr Glu val Ala Ala 

805 810 815 

Ser Cys Gly Gly val val Leu val Gly Leu Met Ala Leu Thr Leu Ser 

820 825 830 

Pro Tyr Tyr Lys Arg Tyr lie Ser Trp Cys Met Trp Trp Leu Gin Tyr 

835 " ' 840 845 

Phe Leu Thr Arg val Glu Ala Gin Leu His val Trp val Pro Pro Leu 

850 ~ 855 860 

Asn Val Arg Gly Gly Arg Asp Ala Val lie Leu Leu Met Cys Val Val 
865 870 875 880 

His Pro Thr Leu val Phe Asp lie Thr Lys Leu Leu Leu Ala lie Phe 

885 890 895 

Gly Pro Leu Trp lie Leu Gin Ala Ser Leu Leu Lys val Pro Tyr Phe 

900 905 910 

val Arg val Gin Gly Leu Leu Arg lie Cys Ala Leu Ala Arg Lys lie 

915 920 925 

Ala Gly Gly His Tyr val Gin Met Ala lie lie Lys Leu Gly Ala Leu 

930 935 940 

Thr Gly Thr Tyr val Tyr Asn His Leu Thr Pro Leu Arg Asp Trp Ala 
945 950 955 960 

His Asn Gly Leu Arg Asp Leu Ala val Ala val Glu Pro val val Phe 

965 970 975 

Ser Arg Met Glu Thr Lys Leu lie Thr Trp Gly Ala Asp Thr Ala Ala 

980 985 990 

Cys Gly Asp lie lie Asn Gly Leu Pro val Ser Ala Arg Arg Gly Gin 

995 1000 1005 

Glu lie Leu Leu Gly Pro Ala Asp Gly Met val Ser Lys Gly Trp Arg 

page 25 



2540109_1.TXT 
1010 1015 1020 

Leu Leu Ala Pro lie Thr Ala Tyr Ala Gin Gin Thr Arg Gly Leu Leu 
1025 1030 1035 1040 

Gly Cys lie lie Thr Ser Leu Thr Gly Arg Asp Lys Asn Gin val Glu 

1045 1050 1055 

Gly Glu val Gin lie val Ser Thr Ala Thr Gin Thr Phe Leu Ala Thr 

1060 1065 1070 

Cys lie Asn Gly Val Cys Trp Thr Val Tyr His Gly Ala Gly Thr Arg 

1075 1080 1085 

Thr lie Ala Ser Pro Lys Gly Pro val lie Gin Met Tyr Thr Asn Val 

1090 1095 1100 

Asp Gin Asp Leu Val Gly Trp Pro Ala Pro Gin Gly Ser Arg Ser Leu 
1105 1110 1115 " 1120 

Thr Pro Cys Thr Cys Gly Ser Ser Asp Leu Tyr Leu val Thr Arg His 

1125 1130 1135 

Ala Asp val He Pro val Arg Arg Arg Gly Asp Ser Arg Gly Ser Leu 

1140 ~ 1145 1150 

Leu Ser Pro Arg Pro lie Ser Tyr Leu Lys Gly Ser Ser Gly Gly Pro 

1155 1160 1165 

Leu Leu Cys Pro Ala Gly His Ala val Gly Leu Phe Arg Ala Ala val 

1170 1175 1180 

Cys Thr Arg Gly val Ala Lys Ala val Asp Phe lie Pro Val Glu Asn 
1185 ~ 1190 1195 1200 

Leu Gly Thr Thr Met Arg Ser Pro val Phe Thr Asp Asn Ser Ser Pro 

1205 1210 1215 

Pro Ala val Pro Gin Ser Phe Gin val Ala His Leu His Ala Pro Thr 

1220 1225 1230 

Gly Ser Gly Lys Ser Thr Lys Val Pro Ala Ala Tyr Ala Ala Gin Gly 

1235 1240 1245 

Tyr Lys val Leu val Leu Asn Pro Ser val Ala Ala Thr Leu Gly Phe 

1250 1255 1260 

Gly Ala Tyr Met Ser Lys Ala His Gly val Asp Pro Asn lie Arg Thr 
1265 1270 1275 1280 

Gly val Arg Thr lie Thr Thr Gly Ser Pro lie Thr Tyr Ser Thr Tyr 

1285 1290 1295 

Gly Lys Phe Leu Ala Asp Gly Gly Cys Ser Gly Gly Ala Tyr Asp lie 

1300 1305 1310 

lie lie Cys Asp Glu Cys His Ser Thr Asp Ala Thr Ser lie Leu Gly 

1315 1320 1325 

lie Gly Thr val Leu Asp Gin Ala Glu Thr Ala Gly Ala Arg Leu val 

1330 1335 1340 

val Leu Ala Thr Ala Thr Pro Pro Gly Ser val Thr val Ser His Pro 
1345 1350 1355 1360 

Asn lie Glu Glu Val Ala Leu Ser Thr Thr Gly Glu lie Pro Phe Tyr 

1365 1370 1375 

Gly Lys Ala lie Pro Leu Glu Val lie Lys Gly Gly Arg His Leu lie 

1380 1385 1390 

Phe Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala Ala Lys Leu val 

1395 1400 1405 

Ala Leu Gly lie Asn Ala Val Ala Tyr Tyr Arg Gly Leu Asp Val Ser 

1410 1415 1420 

val lie Pro Thr Ser Gly Asp val val val Val Ser Thr Asp Ala Leu 
1425 1430 1435 1440 

Met Thr Gly Phe Thr Gly Asp Phe Asp Ser Val lie Asp Cys Asn Thr 

1445 1450 1455 

Cys val Thr Gin Thr val Asp Phe Ser Leu Asp Pro Thr Phe Thr lie 

1460 1465 1470 

Glu Thr Thr Thr Leu Pro Gin Asp Ala val Ser Arg Thr Gin Arg Arg 

1475 1480 1485 

Gly Arg Thr Gly Arg Gly Lys Pro Gly lie Tyr Arg Phe val Ala Pro 

1490 1495 1500 

Gly Glu Arg Pro Ser Gly Met Phe Asp Ser Ser val Leu Cys Glu Cys 
1505 1510 1515 " 1520 
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Tyr Asp Ala Gly Cys Ala Trp Tyr Glu Leu Thr Pro Ala Glu Thr Thr 

. 1525 1530 1535 

val Arg Leu Arg Ala Tyr Met Asn Thr Pro Gly Leu Pro val Cys Gin 

1540 1545 1550 

Asp His Leu Glu Phe Trp Glu Gly Val Phe Thr Gly Leu Thr His lie 

1555 1560 1565 

Asp Ala His Phe Leu Ser Gin Thr Lys Gin Ser Gly Glu Asn Phe Pro 

1570 1575 1580 

Tyr Leu Val Ala Tyr Gin Ala Thr Val Cys Ala Arg Ala Gin Ala Pro 
1585 1590 1595 1600 

Pro Pro Ser Trp Asp Gin Met Trp Lys Cys Leu lie Arg Leu Lys Pro 

1605 1610 1615 

Thr Leu His Gly Pro Thr Pro Leu Leu Tyr Arg Leu Gly Ala val Gin 

1620 1625 ~ 1630 

Asn Glu val Thr Leu Thr His Pro lie Thr Lys Tyr lie Met Thr Cys 

1635 1640 1645 

Met Ser Ala Asp Leu Glu val val Thr Ser Thr Trp Val Leu val Gly 

1650 1655 1660 

Gly val Leu Ala Ala Leu Ala Ala Tyr Cys Leu Ser Thr Gly Cys val 
1665 1670 1675 1680 

val lie val Gly Arg lie val Leu Ser Gly Lys Pro Ala lie lie Pro 

1685 1690 1695 

Asp Arg Glu val Leu Tyr Gin Glu Phe Asp Glu Met Glu Glu Cys Ser 

1700 1705 1710 

Gin His Leu Pro Tyr lie Glu Gin Gly Met Met Leu Ala Glu Gin Phe 

1715 1720 1725 

Lys Gin Lys Ala Leu Gly Leu Leu Gin Thr Ala Ser Arg His Ala Glu 

1730 1735 1740 

val lie Thr Pro Ala val Gin Thr Asn Trp Gin Lys Leu Glu val Phe 
1745 1750 1755 1760 

Trp Ala Lys His Met Trp Asn Phe lie Ser Gly lie Gin Tyr Leu Ala 

1765 1770 1775 

Gly Leu Ser Thr Leu Pro Gly Asn Pro Ala lie Ala Ser Leu Met Ala 

1780 1785 1790 

Phe Thr Ala Ala Val Thr Ser Pro Leu Thr Thr Gly Gin Thr Leu Leu 

1795 1800 1805 

Phe Asn lie Leu Gly Gly Trp val Ala Ala Gin Leu Ala Ala Pro Gly 

1810 1815 1820 

Ala Ala Thr Ala Phe val Gly Ala Gly Leu Ala Gly Ala Ala lie Gly 
1825 1830 1835 1840 

Ser val Gly Leu Gly Lys val Leu val Asp lie Leu Ala Gly Tyr Gly 

1845 1850 1855 

Ala Gly val Ala Gly Ala Leu val Ala Phe Lys lie Met Ser Gly Glu 

1860 1865 1870 

val Pro Ser Thr Glu Asp Leu val Asn Leu Leu Pro Ala lie Leu Ser 

1875 1880 1885 

pro Gly Ala Leu Val val Gly val val Cys Ala Ala lie Leu Arg Arg 

1890 1895 1900 

His val Gly Pro Gly Glu Gly Ala val Gin Trp Met Asn Arg Leu He 
1905 1910 1915 ~ 1920 

Ala Phe Ala Ser Arg Gly Asn His val Ser Pro Thr His Tyr Val Pro 

1925 1930 1935 

Glu Ser Asp Ala Ala Ala Arg val Thr Ala lie Leu Ser Ser Leu Thr 

1940 1945 1950 

val Thr Gin Leu Leu Arg Arg Leu His Gin Trp lie Ser Ser Glu Cys 

1955 " 1960 1965 

Thr Thr Pro Cys Ser Gly Ser Trp Leu Arg Asp lie Trp Asp Trp lie 

1970 1975 1980 

Cys Glu val Leu Ser Asp Phe Lys Thr Trp Leu Lys Ala Lys Leu Met 
1985 1990 1995 2000 

pro Gin Leu Pro Gly lie Pro Phe val Ser Cys Gin Arg Gly Tyr Arg 

2005 2010 ~ 2015 

Gly val Trp Arg Gly Asp Gly lie Met His Thr Arg Cys His Cys Gly 
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2020 2025 2030 

Ala Glu lie Thr Gly His Val Lys Asn Gly Thr Met Arg lie val Gly 

2035 2040 2045 

Pro Arg Thr Cys Arg Asn Met Trp Ser Gly Thr Phe Pro lie Asn Ala 

2050 2055 2060 

Tyr Thr Thr Gly Pro Cys Thr Pro Leu Pro Ala Pro Asn Tyr Lys Phe 
2065 2070 2075 2080 

Ala Leu Trp Arg val Ser Ala Glu Glu Tyr val Glu lie Arg Arg val 

2085 2090 2095 

Gly Asp Phe His Tyr Val Ser Gly Met Thr Thr Asp Asn Leu Lys Cys 

2100 2105 2110 

Pro Cys Gin lie Pro Ser Pro Glu Phe Phe Thr Glu Leu Asp Gly Val 

2115 2120 2125 

Arg Leu His Arg Phe Ala Pro Pro Cys Lys Pro Leu Leu Arg Glu Glu 

2130 " 2135 2140 

Val Ser Phe Arg Val Gly Leu His Glu Tyr Pro Val Gly Ser Gin Leu 
2145 " 2150 2155 2160 

Pro Cys Glu Pro Glu Pro Asp Val Ala Val Leu Thr Ser Met Leu Thr 

2165 2170 2175 

Asp Pro Ser His lie Thr Ala Glu Ala Ala Gly Arg Arg Leu Ala Arg 

2180 2185 2190 

Gly Ser Pro Pro Ser Met Ala Ser Ser Ser Ala Ser Gin Leu Ser Ala 

2195 2200 2205 

Pro Ser Leu Lys Ala Thr Cys Thr Ala Asn His Asp Ser Pro Asp Ala 

2210 2215 2220 

Glu Leu lie Glu Ala Asn Leu Leu Trp Arg Gin Glu Met Gly Gly Asn 
2225 2230 2235 2240 

lie Thr Arg val Glu Ser Glu Asn Lys Val val lie Leu Asp Ser Phe 

2245 2250 2255 

Asp Pro Leu Val Ala Glu Glu Asp Glu Arg Glu val Ser Val Pro Ala 

2260 2265 2270 

Glu lie Leu Arg Lys Ser Arg Arg Phe Ala Arg Ala Leu Pro Val Trp 

2275 2280 2285 

Ala Arg Pro Asp Tyr Asn Pro Pro Leu Val Glu Thr Trp Lys Lys Pro 

2290 2295 2300 

Asp Tyr Glu Pro Pro val Val His Gly Cys Pro Leu Pro Pro Pro Ar 
2305 2310 2315 232 

Ser Pro Pro val Pro Pro Pro Arg Lys Lys Arg Thr Val val Leu Thr 

2325 2330 2335 

Glu Ser Thr Leu Ser Thr Ala Leu Ala Glu Leu Ala Thr Lys Ser Phe 

2340 2345 2350 

Gly Ser Ser Ser Thr Ser Gly lie Thr Gly Asp Asn Thr Thr Thr Ser 

2355 2360 2365 

Ser Glu Pro Ala Pro Ser Gly Cys Pro Pro Asp Ser Asp val Glu Ser 

2370 2375 2380 

Tyr Ser Ser Met Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp Leu 
2385 2390 2395 2400 

Ser Asp Gly Ser Trp Ser Thr val Ser Ser Gly Ala Asp Thr Glu Asp 

2405 2410 2415 

Val val Cys Cys Ser Met Ser Tyr Ser Trp Thr Gly Ala Leu val Thr 

2420 2425 2430 

Pro Cys Ala Ala Glu Glu Gin Lys Leu Pro lie Asn Ala Leu Ser Asn 

2435 2440 2445 

Ser Leu Leu Arg His His Asn Leu Val Tyr Ser Thr Thr Ser Arg Ser 

2450 2455 * 2460 

Ala Cys Gin Arg Gin Lys Lys val Thr Phe Asp Arg Leu Gin val Leu 
2465 " 2470 2475 2480 

Asp Ser His Tyr Gin Asp val Leu Lys Glu val Lys Ala Ala Ala Ser 

2485 2490 2495 

Lys val Lys Ala Asn Leu Leu Ser Val Glu Glu Ala Cys Ser Leu Thr 

2500 2505 2510 

Pro Pro His Ser Ala Lys Ser Lys Phe Gly Tyr Gly Ala Lys Asp val 
2515 2520 2525 
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Arg Cys His Ala Arg Lys Ala val Ala His lie Asn Ser val Trp Lys 

2530 2535 2540 

Asp Leu Leu Glu Asp Ser Val Thr Pro lie Asp Thr Thr lie Met Ala 
2545 2550 2555 2560 

Lys Asn Glu val Phe Cys Val Gin Pro Glu Lys Gly Gly Arg Lys Pro 

2565 2570 2575 

Ala Arg Leu lie val Phe Pro Asp Leu Gly Val Arg val Cys Glu Lys 

2580 2585 ~ 2590 

Met Ala Leu Tyr Asp Val val Ser Lys Leu Pro Leu Ala val Met Gly 

2595 2600 2605 

Ser Ser Tyr Gly Phe Gin Tyr Ser Pro Gly Gin Arg Val Glu Phe Leu 

2610 2615 2620 

val Gin Ala Trp Lys Ser Lys Lys Thr Pro Met Gly Phe Ser Tyr Asp 
2625 2630 2635 2640 

Thr Arg Cys Phe Asp Ser Thr Val Thr Glu Ser Asp lie Arg Thr Glu 

2645 2650 2655 

Glu Ala lie Tyr Gin Cys Cys Asp Leu Asp Pro Gin Ala Arg val Ala 

2660 2665 2670 

lie Lys Ser Leu Thr Glu Arg Leu Tyr val Gly Gly Pro Leu Thr Asn 

2675 2680 2685 

Ser Arg Gly Glu Asn Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly val 

2690 2695 2700 

Leu Thr Thr Ser Cys Gly Asn Thr Leu Thr Cys Tyr lie Lys Ala Arg 
2705 2710 2715 2720 

Ala Ala Cys Arg Ala Ala Gly Leu Gin Asp Cys Thr Met Leu Val Cys 

2725 2730 2735 

Gly Asp Asp Leu val val lie Cys Glu Ser Ala Gly val Gin Glu Asp 

2740 2745 2750 

Ala Ala Ser Leu Arg Ala Phe Thr Glu Ala Met Thr Arg Tyr Ser Ala 

2755 2760 2765 

Pro Pro Gly Asp Pro Pro Gin Pro Glu Tyr Asp Leu Glu Leu lie Thr 

2770 2775 2780 

Ser Cys Ser Ser Asn val Ser val Ala His Asp Gly Ala Gly Lys Arg 
2785 2790 2795 2800 

val Tyr Tyr Leu Thr Arg Asp Pro Thr Thr Pro Leu Ala Arg Ala Ala 

2805 2810 2815 

Trp Glu Thr Ala Arg His Thr Pro val Asn Ser Trp Leu Gly Asn lie 

2820 2825 2830 

lie Met Phe Ala Pro Thr Leu Trp Ala Arg Met lie Leu Met Thr His 

2835 2840 ~ 2845 

Phe Phe Ser val Leu lie Ala Arg Asp Gin Leu Glu Gin Ala Leu Asn 

2850 2855 2860 

Cys Glu lie Tyr Gly Ala Cys Tyr Ser lie Glu Pro Leu Asp Leu Pro 
2865 2870 2875 2880 

Pro lie lie Gin Arg Leu His Gly Leu Ser Ala Phe Ser Leu His Ser 

2885 2890 2895 

Tyr Ser Pro Gly Glu lie Asn Arg val Ala Ala Cys Leu Arg Lys Leu 

2900 2905 2910 

Gly val Pro Pro Leu Arg Ala Trp Arg His Arg Ala Arg Ser val Arg 

2915 " 2920 2925 

Ala Arg Leu Leu Ser Arg Gly Gly Arg Ala Ala lie Cys Gly Lys Tyr 

2930 2935 2940 

Leu Phe Asn Trp Ala val Arg Thr Lys Leu Lys Leu Thr Pro lie Ala 
2945 2950 " 2955 2960 

Ala Ala Gly Arg Leu Asp Leu Ser Gly Trp Phe Thr Ala Gly Tyr Ser 

2965 2970 2975 

Gly Gly Asp lie Tyr His Ser val Ser His Ala Arg Pro Arg Trp Phe 

2980 2985 ~ 2990 

Trp Phe Cys Leu Leu Leu Leu Ala Ala Gly val Gly lie Tyr Leu Leu 

2995 3000 3005 

Prp Asn Arg 
3010 
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<210> 8 
<211> 9595 
<212> DNA 

<213> Hepatitis C virus strain HC-J4 
<400> 8 

gccagccccc tgatgggggc gacactccac catgaatcac tcccctgtga ggaactactg 60 
tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 
cccccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccag 180 
gacgaccggg tcctttcttg gatcaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 
gcgagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 
gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcacg aatcctaaac 360 
ctcaaagaaa aaccaaacgt aacaccaacc gccgcccaca ggacgtcaag ttcccgggcg 420 
gtggtcagat cgttggtgga gtttacctgt tgccgcgcag gggccccagg ttgggtgtgc 480 
gcgcgactag gaaggcttcc gagcggtcgc aacctcgtgg aaggcgacaa cctatcccaa 540 
aggctcgccg acccgagggc agggcctggg ctcagcccgg gtacccttgg cccctctatg 600 
gcaatgaggg cctggggtgg gcaggatggc tcctgtcacc ccgcggctcc cggcctagtt 660 
ggggccccac ggacccccgg cgtaggtcgc gtaacttggg taaggtcatc gataccctta 720 
catgcggctt cgccgatctc atggggtaca ttccgctcgt cggcgccccc ctagggggcg 780 
ctgccagggc cttggcacac ggtgtccggg ttctggagga cggcgtgaac tatgcaacag 840 
ggaacttgcc cggttgctct ttctctatct tcctcttggc tctgctgtcc tgtttgacca 900 
tcccagcttc cgcttatgaa gtgcgcaacg tgtccgggat ataccatgtc acgaacgact 960 
gctccaactc aagcattgtg tatgaggcag cggacgtgat catgcatact cccgggtgcg 1020 
tgccctgtgt tcaggagggt aacagctccc gttgctgggt agcgctcact cccacgctcg 1080 
cggccaggaa tgccagcgtc cccactacga caatacgacg ccacgtcgac ttgctcgttg 1140 
ggacggctgc tttctgctcc gctatgtacg tgggggatct ctgcggatct attttcctcg 1200 
tctcccagct gttcaccttc tcgcctcgcc ggcatgagac agtgcaggac tgcaactgct 1260 
caatctatcc cggccatgta tcaggtcacc gcatggcttg ggatatgatg atgaactggt 1320 
cacctacaac agccctagtg gtgtcgcagt tgctccggat cccacaagct gtcgtggaca 1380 
tggtggcggg ggcccactgg ggagtcctgg cgggccttgc ctactattcc atggtaggga 1440 
actgggctaa ggttctgatt gtggcgctac tctttgccgg cgttgacggg gagacccaca 1500 
cgacggggag ggtggccggc cacaccacct ccgggttcac gtcccttttc tcatctgggg 1560 
cgtctcagaa aatccagctt gtgaatacca acggcagctg gcacatcaac aggactgccc 1620 
taaattgcaa tgactccctc caaactgggt tctttgccgc gctgttttac gcacacaagt 1680 
tcaactcgtc cgggtgcccg gagcgcatgg ccagctgccg ccccattgac tggttcgccc 1740 
aggggtgggg ccccatcacc tatactaagc ctaacagctc ggatcagagg ccttattgct 1800 
ggcattacgc gcctcgaccg tgtggtgtcg tacccgcgtc gcaggtgtgt ggtccagtgt 1860 
attgtttcac cccaagccct gttgtggtgg ggaccaccga tcgttccggt gtccctacgt 1920 
atagctgggg ggagaatgag acagacgtga tgctcctcaa caacacgcgt ccgccacaag 1980 
gcaactggtt cggctgtaca tggatgaata gtactgggtt cactaagacg tgcggaggtc 2040 
ccccgtgtaa catcgggggg gtcggtaacc gcaccttgat ctgccccacg gactgcttcc 2100 
ggaagcaccc cgaggctact tacacaaaat gtggctcggg gccctggttg acacctaggt 2160 
gcctagtaga ctacccatac aggctttggc actacccctg cactctcaat ttttccatct 2220 
ttaaggttag gatgtatgtg gggggcgtgg agcacaggct caatgccgca tgcaattgga 2280 
ctcgaggaga gcgctgtaac ttggaggaca gggataggtc agaactcagc ccgctgctgc 2340 
tgtctacaac agagtggcag atactgccct gtgctttcac caccctaccg gctttatcca 2400 
ctggtttgat ccatctccat cagaacatcg tggacgtgca atacctgtac ggtgtagggt 2460 
cagcgtttgt ctcctttgca atcaaatggg agtacatcct gttgcttttc cttctcctgg 2520 
cagacgcgcg cgtgtgtgcc tgcttgtgga tgatgctgct gatagcccag gctgaggccg 2580 
ccttagagaa cttggtggtc ctcaatgcgg cgtccgtggc cggagcgcat ggtattctct 2640 
cctttcttgt gttcttctgc gccgcctggt acattaaggg caggctggct cctggggcgg 2700 
cgtatgcttt ttatggcgta tggccgctgc tcctgctcct actggcgtta ccaccacgag 2760 
cttacgcctt ggaccgggag atggctgcat cgtgcggggg tgcggttctt gtaggtctgg 2820 
tattcttgac cttgtcacca tactacaaag tgtttctcac taggctcata tggtggttac 2880 
aatactttat caccagagcc gaggcgcaca tgcaagtgtg ggtccccccc ctcaacgttc 2940 
ggggaggccg cgatgccatc atcctcctca cgtgtgcggt tcatccagag ttaatttttg 3000 
acatcaccaa actcctgctc gccatactcg gcccgctcat ggtgctccag gctggcataa 3060 
cgagagtgcc gtacttcgtg cgcgctcaag ggctcattcg tgcatgcatg ttagtgcgaa 3120 
aagtcgccgg gggtcattat gtccaaatgg tcttcatgaa gctgggcgcg ctgacaggta 3180 
cgtacgttta taaccatctt accccactgc gggactgggc ccacgcgggc ctacgagacc 3240 
ttgcggtggc ggtagagccc gtcgtcttct ccgccatgga gaccaaggtc atcacctggg 3300 
gagcagacac cgctgcgtgt ggggacatca tcttgggtct acccgtctcc gcccgaaggg 3360 
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ggaaggagat atttttggga ccggctgata gtctcgaagg gcaagggtgg cgactccttg 3420 

cgcccatcac ggcctactcc caacaaacgc ggggcgtact tggttgcatc atcactagcc 3480 

tcacaggccg ggacaagaac caggtcgaag gggaggttca agtggtttct accgcaacac 3540 

aatctttcct ggcgacctgc atcaacggcg tgtgctggac tgtctaccat ggcgctggct 3600 

cgaagaccct agccggtcca aaaggtccaa tcacccaaat gtacaccaat gtagacctgg 3660 

acctcgtcgg ctggcaggcg ccccccgggg cgcgctccat gacaccatgc agctgtggca 3720 

gctcggacct ttacttggtc acgagacatg ctgatgtcat tccggtgcgc cggcgaggcg 3780 

acagcagggg aagtctactc tcccccaggc ccgtctccta cctgaaaggc tcctcgggtg 3840 

gtccattgct ttgcccttcg gggcacgtcg tgggcgtctt ccgggctgct gtgtgcaccc 3900 

ggggggtcgc gaaggcggtg gacttcatac ccgttgagtc tatggaaact accatgcggt 3960 

ctccggtctt cacagacaac tcaacccccc cggctgtacc gcagacattc caagtggcac 4020 

atctgcacgc tcctactggc agcggcaaga gcaccaaagt gccggctgcg tatgcagccc 4080 

aagggtacaa ggtgctcgtc ctgaacccgt ccgttgccgc caccttaggg tttggggcgt 4140 

atatgtccaa ggcacacggt atcgacccta acatcagaac tggggtaagg accattacca 4200 

cgggcggctc cattacgtac tccacctatg gcaagttcct tgccgacggt ggctgttctg 4260 

ggggcgccta tgacatcata atatgtgatg agtgccactc aactgactcg actaccatct 4320 

tgggcatcgg cacagtcctg gaccaagcgg agacggctgg agcgcggctc gtcgtgctcg 4380 

ccaccgctac acctccggga tcggttaccg tgccacaccc caatatcgag gaaataggcc 4440 

tgtccaacaa tggagagatc cccttctatg gcaaagccat ccccattgag gccatcaagg 4500 

gggggaggca tctcattttc tgccattcca agaagaaatg tgacgagctc gccgcaaagc 4560 

tgacaggcct cggactgaac gctgtagcat attaccgggg ccttgatgtg tccgtcatac 4620 

cgcctatcgg agacgtcgtt gtcgtggcaa cagacgctct aatgacgggt ttcaccggcg 4680 

attttgactc agtgatcgac tgcaatacat gtgtcaccca gacagtcgac ttcagcttgg 4740 

atcccacctt caccattgag acgacgaccg tgccccaaga cgcggtgtcg cgctcgcaac 4800 

ggcgaggtag aactggcagg ggtaggagtg gcatctacag gtttgtgact ccaggagaac 4860 

ggccctcggg catgttcgat tcttcggtcc tgtgtgagtg ctatgacgcg ggctgtgctt 4920 

ggtatgagct cacgcccgct gagacctcgg ttaggttgcg ggcttaccta aatacaccag 4980 

ggttgcccgt ctgccaggac catctggagt tctgggagag cgtcttcaca ggcctcaccc 5040 

acatagatgc ccacttcctg tcccagacta aacaggcagg agacaacttt ccttacctgg 5100 

tggcatatca agctacagtg tgcgccaggg ctcaagctcc acctccatcg tgggaccaaa 5160 

tgtggaagtg tctcatacgg ctgaaaccta cactgcacgg gccaacaccc ctgctgtata 5220 

ggctaggagc cgtccaaaat gaggtcatcc tcacacaccc cataactaaa tacatcatgg 5280 

catgcatgtc ggctgacctg gaggtcgtca ctagcacctg ggtgctggta ggcggagtcc 5340 

ttgcagcttt ggccgcatac tgcctgacga caggcagtgt ggtcattgtg ggcaggatca 5400 

tcttgtccgg gaagccagct gtcgttcccg acagggaagt cctctaccag gagttcgatg 5460 

agatggaaga gtgtgcctca caacttcctt acatcgagca gggaatgcag ctcgccgagc 5520 

aattcaagca aaaggcgctc gggttgttgc aaacggccac caagcaagcg gaggctgctg 5580 

ctcccgtggt ggagtccaag tggcgagccc ttgagacctt ctgggcgaag cacatgtgga 5640 

atttcatcag cggaatacag tacctagcag gcttatccac tctgcctgga aaccccgcga 5700 

tagcatcatt gatggcattt acagcttcta tcactagccc gctcaccacc caaaacaccc 5760 

tcctgtttaa catcttgggg ggatgggtgg ctgcccaact cgctcctccc agcgctgcgt 5820 

cagctttcgt gggcgccggc atcgccggag cggctgttgg cagcataggc cttgggaagg 5880 

tgctcgtgga catcttggcg ggctatgggg caggggtagc cggcgcactc gtggccttta 5940 

aggtcatgag cggcgaggtg ccctccaccg aggacctggt caacttactc cctgccatcc 6000 

tctctcctgg tgccctggtc gtcggggtcg tgtgcgcagc aatactgcgt cggcacgtgg 6060 

gcccgggaga gggggctgtg cagtggatga accggctgat agcgttcgct tcgcggggta 6120 

accacgtctc ccctacgcac tatgtgcctg agagcgacgc tgcagcacgt gtcactcaga 6180 

tcctctctag ccttaccatc actcaactgc tgaagcggct ccaccagtgg attaatgagg 6240 

actgctctac gccatgctcc ggctcgtggc taagggatgt ttgggattgg atatgcacgg 6300 

tgttgactga cttcaagacc tggctccagt ccaaactcct gccgcggtta ccgggagtcc 6360 

ctttcctgtc atgccaacgc gggtacaagg gagtctggcg gggggacggc atcatgcaaa 6420 

ccacctgccc atgcggagca cagatcgccg gacatgtcaa aaacggttcc atgaggatcg 6480 

tagggcctag aacctgcagc aacacgtggc acggaacgtt ccccatcaac gcatacacca 6540 

cgggaccttg cacaccctcc ccggcgccca actattccag ggcgctatgg cgggtggctg 6600 

ctgaggagta cgtggaggtt acgcgtgtgg gggatttcca ctacgtgacg ggcatgacca 6660 

ctgacaacgt aaagtgccca tgccaggttc cggcccccga attcttcacg gaggtggatg 6720 

gagtgcggtt gcacaggtac gctccggcgt gcaaacctct tctacgggag gacgtcacgt 6780 

tccaggtcgg gctcaaccaa tacttggtcg ggtcgcagct cccatgcgag cccgaaccgg 6840 

acgtaacagt gcttacttcc atgctcaccg atccctccca cattacagca gagacggcta 6900 

agcgtaggct ggctagaggg tctcccccct ctttagccag ctcatcagct agccagttgt 6960 

ctgcgccttc tttgaaggcg acatgcacta cccaccatga ctccccggac gctgacctca 7020 

tcgaggccaa cctcttgtgg cggcaggaga tgggcggaaa catcactcgc gtggagtcag 7080 

agaataaggt agtaattctg gactctttcg aaccgcttca cgcggagggg gatgagaggg 7140 
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agatatccgt 
tatgggcacg 
tccctccggt 
cacggagaaa 
tcgccactaa 
cccttcctga 
ccatgccccc 
ccgtgagtga 
gcgccctgat 
actctttgct 
ggcagaagaa 
tcaaggagat 
cctgcaagct 
tccggaacct 
aagacactga 
*aaccagagaa 
gtgtatgcga 
gctcctcata 
ggaaatcaaa 
tcactgagag 
aggccagaca 
actcaaaagg 
gctgcggtaa 
tccaggactg 
gaacccagga 
ccccccccgg 
ccaatgtgtc 
ccaccacccc 
ggctaggcaa 
actttttctc 
acggggcttg 
gtcttagcgc 
gcctcaggaa 
gcgctaagct 
gggcagtaag 
ctggctggtt 
gaccccgctg 
tccccaaccg 
tttttttttt 
tttttccctt 
ccgtgagccg 



cgcggcggag 
cccggactac 
ggtacacgga 
gaggacggtt 
gaccttcggt 
cctggcctcc 
ccttgaaggg 
ggaggctagt 
cacgccatgc 
gcgtcaccac 
ggtcaccttt 
gaaggcgaag 
gacgccccca 
atccagcagg 
aacaccaatt 
gggaggccgc 
gaagatggcc 
cggatttcaa 
gaaatgccct 
tgacattcgt 
ggccataagg 
gcagaactgc 
taccctcaca 
cacgatgctc 
ggatgcggcg 
ggatccgccc 
agtcgcgcac 
ccttgcacgg 
tatcatcatg 
catccttcta 
ctactccatt 
atttacactc 
acttggggta 
actgtcccag 
gaccaagctt 
cgtcgctggt 
gtttccgttg 
atgaacgggg 
tttttttttt 
ctttaatggt 
catgactgca 



atcctgcgaa 
aatcctccac 
tgcccattgc 
gtcctgacag 
agctccggat 
gacgacggtg 
gagccggggg 
gaggatgtcg 
gctgcggagg 
aacatggtct 
gacagattgc 
gcgtccacag 
cattcggcca 
gccgttaacc 
gacaccacca 
aagccagctc 
ctttacgacg 
tactccccca 
atgggcttct 
gttgaggagt 
tcgctcacag 
ggttatcgcc 
tgttacttga 
gtgaacggag 
gccctacgag 
caaccagaat 
gatgcatctg 
gctgcgtggg 
tatgcgccca 
gctcaagagc 
gagccacttg 
cacagttact 
ccacccttgc 

ggggggaggg 

aaactcactc 
tacagcgggg 
tgcctactcc 
agctaaccac 
tctttttttt 
ggctccatct 
gagagtgctg 
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aatccaggaa gttcccctca gcgttgccca 7200 

tgctagagtc ctggaaggac ccggactacg 7260 

cacctaccaa ggctcctcca ataccacctc 7320 

aatccaatgt gtcttctgcc ttggcggagc 7380 

cgtcggccgt tgatagcggc acggcgaccg 7440 

acaaaggatc cgacgttgag tcgtactcct 7500 

accccgatct cagcgacggg tcttggtcta 7560 

tctgctgctc aatgtcctat acgtggacag 7620 

aaagtaagct gcccatcaac ccgttgagca 7680 

acgccacaac atcccgcagc gcaagcctcc 7740 

aagtcctgga tgatcattac cgggacgtac 7800 

ttaaggctaa gcttctatct atagaggagg 7860 

aatccaaatt tggctatggg gcaaaggacg 7920 

acatccgctc cgtgtgggag gacttgctgg 7980 

tcatggcaaa aagtgaggtt ttctgcgtcc 8040 

gccttatcgt attcccagac ctgggagttc 8100 

tggtctccac ccttcctcag gccgtgatgg 8160 

agcagcgggt cgagttcctg gtgaatacct 8220 

catatgacac ccgctgtttt gactcaacgg 8280 

caatttacca atgttgtgac ttggcccccg 8340 

agcggcttta catcgggggt cccctgacta 8400 

ggtgccgcgc aagtggcgtg ctgacgacta 8460 

aggccactgc agcctgtcga gctgcaaagc 8520 

acgaccttgt cgttatctgt gaaagcgcgg 8580 

ccttcacgga ggctatgact aggtattccg 8640 

acgacctgga gctgataaca tcatgttcct 8700 

gcaaaagggt atactacctc acccgtgacc 8760 

agacagctag acacactcca atcaactctt 8820 

ccctatgggc aaggatgatt ctgatgactc 8880 

aacttgaaaa agccctggat tgtcagatct 8940 

acctacctca gatcattgaa cgactccatg 9000 

ctccaggtga gatcaatagg gtggcttcat 9060 

gaacctggag acatcgggcc agaagtgtcc 9120 

ccgccacttg tggcagatac ctctttaact 9180 

caatcccggc cgcgtcccag ctggacttgt 9240 

gagacatata tcacagcctg tctcgtgccc 9300 

tactttctgt aggggtaggc atttacctgc 9360 

tccaggcctt aagccatttc ctgttttttt 9420 

tttctttcct ttccttcttt ttttcctttc 9480 

tagccctagt cacggctagc tgtgaaaggt 9540 

atactggcct ctctgcagat catgt 9595 
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Ser lie Phe Lys Val Arg Met Tyr Val Gly Gly val Glu His Arg Leu 
625 630 635 640 

Asn Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg Cys Asn Leu Glu Asp 

645 650 655 

Arq Asp Arq Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp 

y 660 665 670 

Gin lie Leu Pro Cys Ala Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly 

675 680 685 

Leu lie His Leu His Gin Asn lie Val Asp Val Gin Tyr Leu Tyr Gly 

690 695 700 

val Gly Ser Ala Phe Val Ser Phe Ala lie Lys Trp Glu Tyr lie Leu 
705 710 715 720 

Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys Ala Cys Leu Trp 

725 730 735 

Met Met Leu Leu lie Ala Gin Ala Glu Ala Ala Leu Glu Asn Leu Val 

740 745 750 

val Leu Asn Ala Ala Ser val Ala Gly Ala His Gly lie Leu Ser Phe 

755 760 765 

Leu val Phe Phe Cys Ala Ala Trp Tyr lie Lys Gly Arg Leu Ala Pro 

770 775 780 

Gly Ala Ala Tyr Ala Phe Tyr Gly val Trp Pro Leu Leu Leu Leu Leu 
785 790 * 795 800 

Leu Ala Leu Pro Pro Arg Ala Tyr Ala Leu Asp Arg Glu Met Ala Ala 

805 810 815 

Ser Cys Gly Gly Ala Val Leu val Gly Leu val Phe Leu Thr Leu Ser 

820 825 830 

Pro Tyr Tyr Lys val Phe Leu Thr Arg Leu lie Trp Trp Leu Gin Tyr 

835 840 845 

Phe lie Thr Arg Ala Glu Ala His Met Gin val Trp Val Pro Pro Leu 

850 855 860 

Asn val Arg Gly Gly Arg Asp Ala lie lie Leu Leu Thr Cys Ala val 
865 870 875 880 

His Pro Glu Leu lie Phe Asp lie Thr Lys Leu Leu Leu Ala lie Leu 

885 890 895 

Gly Pro Leu Met Val Leu Gin Ala Gly lie Thr Arg val Pro Tyr Phe 

900 905 910 

Val Arg Ala Gin Gly Leu lie Arg Ala Cys Met Leu val Arg Lys val 

915 920 925 

Ala Gly Gly His Tyr Val Gin Met val Phe Met Lys Leu Gly Ala Leu 

930 935 940 

Thr Gly Thr Tyr val Tyr Asn His Leu Thr Pro Leu Arg Asp Trp Ala 
945 950 955 960 

His Ala Gly Leu Arg Asp Leu Ala Val Ala Val Glu Pro val Val Phe 

965 970 975 

Ser Ala Met Glu Thr Lys val lie Thr Trp Gly Ala Asp Thr Ala Ala 

980 985 990 

Cys Gly Asp lie lie Leu Gly Leu Pro val Ser Ala Arg Arg Gly Lys 

995 1000 1005 

Glu lie Phe Leu Gly Pro Ala Asp Ser Leu Glu Gly Gin Gly Trp Arg 

1010 1015 1020 

Leu Leu Ala Pro lie Thr Ala Tyr Ser Gin Gin Thr Arg Gly val Leu 
1025 1030 1035 1040 

Gly Cys lie lie Thr Ser Leu Thr Gly Arg Asp Lys Asn Gin val Glu 

1045 1050 1055 

Gly Glu val Gin Val Val Ser Thr Ala Thr Gin Ser Phe Leu Ala Thr 

1060 1065 1070 

Cys lie Asn Gly val Cys Trp Thr val Tyr His Gly Ala Gly Ser Lys 

1075 1080 1085 

Thr Leu Ala Gly Pro Lys Gly Pro lie Thr Gin Met Tyr Thr Asn val 

1090 1095 1100 

Asp Leu Asp Leu val Gly Trp Gin Ala Pro Pro Gly Ala Arg Ser Met 
1105 1110 1115 1120 

Thr Pro Cys Ser Cys Gly Ser Ser Asp Leu Tyr Leu val Thr Arg His 
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1125 1130 1135 

Ala Asp val lie Pro val Arg Arg Arg Gly Asp Ser Arg Gly Ser Leu 

1140 1145 1150 

Leu Ser Pro Arg Pro Val Ser Tyr Leu Lys Gly Ser Ser Gly Gly Pro 

1155 1160 1165 

Leu Leu Cys Pro Ser Gly His val val Gly val Phe Arg Ala Ala val 

1170 1175 1180 

Cys Thr Arg Gly val Ala Lys Ala Val Asp Phe lie Pro Val Glu Ser 
1185 " 1190 1195 1200 

Met Glu Thr Thr Met Arg Ser Pro Val Phe Thr Asp Asn Ser Thr Pro 

1205 1210 1215 

Pro Ala val Pro Gin Thr Phe Gin Val Ala His Leu His Ala Pro Thr 

1220 1225 1230 

Gly Ser Gly Lys Ser Thr Lys Val Pro Ala Ala Tyr Ala Ala Gin Gly 

1235 1240 1245 

Tyr Lys Val Leu Val Leu Asn Pro Ser val Ala Ala Thr Leu Gly Phe 

1250 1255 1260 

Gly Ala Tyr Met Ser Lys Ala His Gly lie Asp Pro Asn lie Arg Thr 
1265 1270 1275 1280 

Gly Val Arg Thr lie Thr Thr Gly Gly Ser lie Thr Tyr Ser Thr Tyr 

1285 1290 1295 

Gly Lys Phe Leu Ala Asp Gly Gly Cys Ser Gly Gly Ala Tyr Asp lie 

1300 1305 1310 

lie lie Cys Asp Glu Cys His Ser Thr Asp Ser Thr Thr lie Leu Gly 

1315 1320 1325 

lie Gly Thr val Leu Asp Gin Ala Glu Thr Ala Gly Ala Arg Leu Val 

1330 1335 1340 

val Leu Ala Thr Ala Thr Pro Pro Gly Ser val Thr val Pro His Pro 
1345 1350 1355 1360 

Asn lie Glu Glu lie Gly Leu Ser Asn Asn Gly Glu lie Pro Phe Tyr 

1365 1370 1375 

Gly Lys Ala lie Pro lie Glu Ala lie Lys Gly Gly Arg His Leu lie 

1380 1385 1390 

Phe Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala Ala Lys Leu Thr 

1395 1400 1405 

Gly Leu Gly Leu Asn Ala val Ala Tyr Tyr Arg Gly Leu Asp val Ser 

1410 1415 1420 

Val lie Pro Pro lie Gly Asp val Val val val Ala Thr Asp Ala Leu 
1425 1430 1435 1440 

Met Thr Gly Phe Thr Gly Asp Phe Asp Ser val lie Asp Cys Asn Thr 

1445 1450 1455 

Cys Val Thr Gin Thr val Asp Phe Ser Leu Asp Pro Thr Phe Thr lie 

1460 1465 1470 

Glu Thr Thr Thr val Pro Gin Asp Ala val Ser Arg Ser Gin Arg Arg 

1475 1480 1485 

Gly Arg Thr Gly Arg Gly Arg Ser Gly lie Tyr Arg Phe val Thr Pro 

1490 1495 1500 

Gly Glu Arg Pro Ser Gly Met Phe Asp Ser Ser Val Leu Cys Glu Cys 
1505 1510 1515 1520 

Tyr Asp Ala Gly Cys Ala Trp Tyr Glu Leu Thr Pro Ala Glu Thr Ser 

1525 1530 1535 

val Arg Leu Arg Ala Tyr Leu Asn Thr Pro Gly Leu Pro val Cys Gin 

1540 1545 1550 

Asp His Leu Glu Phe Trp Glu Ser val Phe Thr Gly Leu Thr His lie 

1555 1560 1565 

Asp Ala His Phe Leu Ser Gin Thr Lys Gin Ala Gly Asp Asn Phe Pro 

1570 1575 1580 

Tyr Leu val Ala Tyr Gin Ala Thr Val Cys Ala Arg Ala Gin Ala Pro 
1585 1590 1595 1600 

Pro Pro Ser Trp Asp Gin Met Trp Lys Cys Leu lie Arg Leu Lys Pro 

1605 1610 1615 

Thr Leu His Gly Pro Thr Pro Leu Leu Tyr Arg Leu Gly Ala Val Gin 
1620 1625 1630 
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Asn Glu val lie Leu Thr His Pro lie Thr Lys Tyr lie Met Ala Cys 

1635 1640 1645 

Met Ser Ala Asp Leu Glu val val Thr Ser Thr Trp Val Leu val Gly 

1650 1655 1660 

Gly val Leu Ala Ala Leu Ala Ala Tyr Cys Leu Thr Thr Gly Ser Val 
1665 1670 1675 1680 

val lie Val Gly Arg lie lie Leu Ser Gly Lys Pro Ala Val val Pro 

1685 1690 1695 

Asp Arg Glu val Leu Tyr Gin Glu Phe Asp Glu Met Glu Glu Cys Ala 

1700 1705 1710 

Ser Gin Leu Pro Tyr lie Glu Gin Gly Met Gin Leu Ala Glu Gin Phe 

1715 1720 1725 

Lys Gin Lys Ala Leu Gly Leu Leu Gin Thr Ala Thr Lys Gin Ala Glu 

1730 1735 1740 

Ala Ala Ala Pro val val Glu Ser Lys Trp Arg Ala Leu Glu Thr Phe 
1745 1750 * 1755 1760 

Trp Ala Lys His Met Trp Asn Phe lie Ser Gly lie Gin Tyr Leu Ala 

1765 1770 1775 

Gly Leu Ser Thr Leu Pro Gly Asn Pro Ala lie Ala Ser Leu Met Ala 

1780 1785 1790 

Phe Thr Ala Ser lie Thr Ser Pro Leu Thr Thr Gin Asn Thr Leu Leu 

1795 1800 1805 

Phe Asn lie Leu Gly Gly Trp Val Ala Ala Gin Leu Ala Pro Pro Ser 

1810 1815 1820 

Ala Ala Ser Ala Phe Val Gly Ala Gly lie Ala Gly Ala Ala val Gly 
1825 1830 1835 1840 

Ser lie Gly Leu Gly Lys val Leu Val Asp lie Leu Ala Gly Tyr Gly 

1845 1850 1855 

Ala Gly val Ala Gly Ala Leu val Ala Phe Lys val Met Ser Gly Glu 

1860 1865 1870 

Val Pro Ser Thr Glu Asp Leu val Asn Leu Leu Pro Ala lie Leu Ser 

1875 1880 1885 

Pro Gly Ala Leu val val Gly val Val Cys Ala Ala lie Leu Arg Arg 

1890 1895 1900 

His val Gly Pro Gly Glu Gly Ala val Gin Trp Met Asn Arg Leu lie 
1905 " 1910 1915 1920 

Ala Phe Ala Ser Arg Gly Asn His val Ser Pro Thr His Tyr val Pro 

1925 1930 1935 

Glu Ser Asp Ala Ala Ala Arg Val Thr Gin lie Leu Ser Ser Leu Thr 

1940 ~ 1945 1950 

lie Thr Gin Leu Leu Lys Arg Leu His Gin Trp lie Asn Glu Asp Cys 

1955 1960 1965 

Ser Thr Pro Cys Ser Gly Ser Trp Leu Arg Asp val Trp Asp Trp lie 

1970 1975 1980 

Cys Thr Val Leu Thr Asp Phe Lys Thr Trp Leu Gin Ser Lys Leu Leu 
1985 1990 1995 2000 

Pro Arg Leu Pro Gly val Pro Phe Leu Ser Cys Gin Arg Gly Tyr Lys 

2005 2010 2015 

Gly Val Trp Arg Gly Asp Gly lie Met Gin Thr Thr Cys Pro Cys Gly 

2020 2025 2030 

Ala Gin lie Ala Gly His val Lys Asn Gly Ser Met Arg lie val Gly 

2035 * 2040 2045 

Pro Arg Thr Cys Ser Asn Thr Trp His Gly Thr Phe Pro lie Asn Ala 

2050 ' 2055 2060 

Tyr Thr Thr Gly Pro Cys Thr Pro Ser Pro Ala Pro Asn Tyr Ser Arg 
2065 2070 2075 2080 

Ala Leu Trp Arg val Ala Ala Glu Glu Tyr val Glu val Thr Arg val 

2085 2090 2095 

Gly Asp Phe His Tyr val Thr Gly Met Thr Thr Asp Asn Val Lys Cys 

2100 2105 2110 

Pro Cys Gin val Pro Ala Pro Glu Phe Phe Thr Glu Val Asp Gly val 

2115 2120 2125 

Arg Leu His Arg Tyr Ala Pro Ala Cys Lys Pro Leu Leu Arg Glu Asp 
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2130 2135 2140 

val Thr Phe Gin val Gly Leu Asn Gin Tyr Leu val Gly Ser Gin Leu 
2145 2150 2155 2160 

Pro Cys Glu Pro Glu Pro Asp val Thr val Leu Thr Ser Met Leu Thr 

2165 2170 2175 

Asp Pro Ser His lie Thr Ala Glu Thr Ala Lys Arg Arg Leu Ala Arg 

2180 2185 2190 

Gly Ser Pro Pro Ser Leu Ala Ser Ser Ser Ala Ser Gin Leu Ser Ala 

2195 2200 2205 

Pro Ser Leu Lys Ala Thr Cys Thr Thr His His Asp Ser Pro Asp Ala 

2210 2215 2220 

Asp Leu lie Glu Ala Asn Leu Leu Trp Arg Gin Glu Met Gly Gly Asn 
2225 2230 2235 2240 

lie Thr Arg val Glu Ser Glu Asn Lys val val lie Leu Asp Ser Phe 

2245 2250 2255 

Glu Pro Leu His Ala Glu Gly Asp Glu Arg Glu lie Ser val Ala Ala 

2260 2265 2270 

Glu lie Leu Arg Lys Ser Arg Lys Phe Pro Ser Ala Leu Pro lie Trp 

2275 2280 2285 

Ala Arg Pro Asp Tyr Asn Pro Pro Leu Leu Glu Ser Trp Lys Asp Pro 

2290 2295 2300 

Asp Tyr val Pro Pro val Val His Gly Cys Pro Leu Pro Pro Thr Lys 
2305 2310 2315 2320 

Ala Pro Pro lie Pro Pro Pro Arg Arg Lys Arg Thr val Val Leu Thr 

2325 ~ 2330 2335 

Glu Ser Asn Val Ser Ser Ala Leu Ala Glu Leu Ala Thr Lys Thr Phe 

2340 2345 2350 

Gly Ser Ser Gly Ser Ser Ala Val Asp Ser Gly Thr Ala Thr Ala Leu 

2355 2360 2365 

Pro Asp Leu Ala Ser Asp Asp Gly Asp Lys Gly Ser Asp Val Glu Ser 

2370 2375 2380 

Tyr Ser Ser Met Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp Leu 
2385 2390 " 2395 2400 

Ser Asp Gly Ser Trp Ser Thr Val Ser Glu Glu Ala Ser Glu Asp val 

2405 2410 2415 

val Cys Cys Ser Met Ser Tyr Thr Trp Thr Gly Ala Leu lie Thr Pro 

2420 2425 2430 

Cys Ala Ala Glu Glu Ser Lys Leu Pro lie Asn Pro Leu Ser Asn Ser 

2435 2440 2445 

Leu Leu Arg His His Asn Met val Tyr Ala Thr Thr Ser Arg Ser Ala 

2450 2455 2460 

Ser Leu Arg Gin Lys Lys Val Thr Phe Asp Arg Leu Gin Val Leu Asp 
2465 2470 2475 2480 

Asp His Tyr Arg Asp val Leu Lys Glu Met Lys Ala Lys Ala Ser Thr 

2485 2490 2495 

val Lys Ala Lys Leu Leu Ser lie Glu Glu Ala Cys Lys Leu Thr Pro 

2500 2505 2510 

Pro His Ser Ala Lys Ser Lys Phe Gly Tyr Gly Ala Lys Asp val Arg 

2515 2520 2525 

Asn Leu Ser Ser Arg Ala val Asn His lie Arg Ser val Trp Glu Asp 

2530 " 2535 2540 

Leu Leu Glu Asp Thr Glu Thr Pro lie Asp Thr Thr lie Met Ala Lys 
2545 2550 2555 2560 

Ser Glu val Phe Cys Val Gin Pro Glu Lys Gly Gly Arg Lys Pro Ala 

2565 2570 2575 

Arg Leu lie val Phe Pro Asp Leu Gly val Arg Val Cys Glu Lys Met 

2580 2585 2590 

Ala Leu Tyr Asp val val Ser Thr Leu Pro Gin Ala val Met Gly Ser 

2595 2600 2605 

Ser Tyr Gly Phe Gin Tyr Ser Pro Lys Gin Arg val Glu Phe Leu Val 

2610 2615 ' 2620 

Asn Thr Trp Lys Ser Lys Lys Cys Pro Met Gly Phe Ser Tyr Asp Thr 
2625 2630 2635 2640 
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Arq Cys Phe Asp Ser Thr Val Thr Glu Ser Asp He Arg val Glu G"lu 

2645 2650 2655 

Ser lie Tyr Gin Cys Cys Asp Leu Ala Pro Glu Ala Arg Gin Ala lie 

2660 2665 2670 

Arg Ser Leu Thr Glu Arg Leu Tyr lie Gly Gly Pro Leu Thr Asn Ser 

2675 " 2680 2685 

Lys Gly Gin Asn Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly val Leu 

2690 2695 2700 

Thr Thr Ser Cys Gly Asn Thr Leu Thr Cys Tyr Leu Lys Ala Thr Ala 
2705 ' 2710 2715 2720 

Ala Cys Arg Ala Ala Lys Leu Gin Asp Cys Thr Met Leu val Asn Gly 

2725 2730 2735 

Asp Asp Leu val Val lie Cys Glu Ser Ala Gly Thr Gin Glu Asp Ala 

2740 2745 2750 

Ala Ala Leu Arg Ala Phe Thr Glu Ala Met Thr Arg Tyr Ser Ala Pro 

2755 2760 2765 

pro Gly Asp Pro Pro Gin Pro Glu Tyr Asp Leu Glu Leu lie Thr ser 

2770 2775 2780 

Cys Ser ser Asn val Ser val Ala His Asp Ala Ser Gly Lys Arg val 
2785 2790 2795 2800 

Tyr Tyr Leu Thr Arg Asp Pro Thr Thr Pro Leu Ala Arg Ala Ala Trp 

2805 2810 2815 

Glu Thr Ala Arg His Thr Pro lie Asn Ser Trp Leu Gly Asn lie lie 

2820 2825 2830 

Met Tyr Ala Pro Thr Leu Trp Ala Arg Met lie Leu Met Thr His Phe 

2835 2840 2845 

Phe Ser lie Leu Leu Ala Gin Glu Gin Leu Glu Lys Ala Leu Asp Cys 

2850 2855 2860 

Gin lie Tyr Gly Ala Cys Tyr Ser lie Glu Pro Leu Asp Leu Pro Gin 
2865 2870 2875 2880 

lie lie Glu Arg Leu His Gly Leu Ser Ala Phe Thr Leu His Ser Tyr 

2885 2890 2895 

Ser Pro Gly Glu lie Asn Arg Val Ala Ser Cys Leu Arg Lys Leu Gly 

2900 " 2905 2910 

val Pro Pro Leu Arg Thr Trp Arg His Arg Ala Arg Ser val Arg Ala 

2915 2920 2925 

Lys Leu Leu Ser Gin Gly Gly Arg Ala Ala Thr Cys Gly Arg Tyr Leu 

2930 2935 2940 

Phe Asn Trp Ala val Arg Thr Lys Leu Lys Leu Thr Pro lie Pro Ala 
2945 2950 2955 2960 

Ala Ser Gin Leu Asp Leu Ser Gly Trp Phe val Ala Gly Tyr Ser Gly 

2965 2970 2975 

Gly Asp lie Tyr His Ser Leu Ser Arg Ala Arg Pro Arg Trp Phe Pro 

2980 2985 " 2990 

Leu Cys Leu Leu Leu Leu Ser val Gly val Gly lie Tyr Leu Leu Pro 
2995 3000 3005 

Asn Arg 
3010 

<210> 10 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic primer 
<400> 10 

cctagcaggg cgtgggggat ttcc 

<210> 11 
<211> 24 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic primer 
<400> 11 

aggtctgcgt ccttggtagt gacc 

<210> 12 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic primer 
<400> 12 

ggatttcccc tgcccgtctg 

<210> 13 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic primer 
<400> 13 

ccccggtctt ccctacagtg 
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